

1 CGGGCGCGGC GGCGGCGGCG GTGACAGCGG CGCCCGCGCC TCCCCGCGCG 

51 TAGGTGTGCG GCGCGCTCCT GGCGAGGACG GAGCGAGCAG ATCTCGCGTG 

101 CGCTCGCCGC CCGGCGCAGC CCAGCCCGGC CCCCGCCTGG CGCCGCGAGC 

151 CGAGGTGTCT CCCGCGCCCG CGCCCGTGTC GCCGCCGTGC CCGCGAGCGG 

201 GAGCCGGAGT CGCCGCCGCC CGAGCGCAGC CGAGCGCACG CCGAGCCCGT 

251 CCGCCGCCGC CATGGCCACC ACGGTGACCT GCACCCGCTT CACCGACGAG 

3 01 TACCAGCTCT ACGAGGATAT TGGCAAGGGG GCTTTCTCTG TGGTCCGACG 

3 51 CTGTGTCAAG CTCTGCACCG GCCATGAGTA TGCAGCCAAG ATCATCAACA 

4 01 CCAAGAAGCT GTCAGCCAGA GATCACCAGA AGCTGGAGAG AGAGGCTCGG 
451 ATCTGCCGCC TTCTGAAGCA TTCCAACATC GTGCGTCTCC ACGACAGCAT 
501 CTCCGAGGAG GGCTTCCACT ACCTGGTCTT CGATCTGGTC ACTGGTGGGG 
551 AGCTCTTTGA AGACATTGTG GCGAGAGAGT ACTACAGCGA GGCTGATGCC 
601 AGTCACTGTA TCCAGCAGAT CCTGGAGGCC GTTCTCCATT GTCACCAAAT 
651 GGGGGTCGTC CACAGAGACC TCAAGCCGGA GAACCTGCTT CTGGCCAGCA 
701 AGTGCAAAGG GGCTGCAGTG AAGCTGGCAG ACTTCGGCCT AGCTATCGAG 
751 GTGCAGGGGG ACCAGCAGGC ATGGTTTGGT TTCGCTGGCA CACCAGGCTA 
801 CCTGTCCCCT GAGGTCCTTC GCAAAGAGGC GTATGGCAAG CCTGTGGACA 
851 TCTGGGCATG TGGGGTGATC CTGTACATCC TGCTCGTGGG CTACCCACCC 
901 TTCTGGGACG AGGACCAGCA CAAGCTGTAC CAGCAGATCA AGGCTGGTGC 
951 CTATGACTTC CCGTCCCCTG AGTGGGACAC CGTCACTCCT GAAGCCAAAA 

1001 ACCTCATCAA CCAGATGCTG ACCATCAACC CTGCCAAGCG CATCACAGCC 

1051 CATGAGGCCC TGAAGCACCC GTGGGTCTGC CAACGCTCCA CGGTAGCATC 

1101 CATGATGCAC AGACAGGAGA CTGTGGAGTG TCTGAAAAAG TTCAATGCCA 

1151 GGAGAAAGCT CAAGGGAGCC ATCCTCACCA CCATGCTGGC CACACGGAAT 

1201 TTCTCAGTGG GCAGACAGAC CACCGCTCCG GCCACAATGT CCACCGCGGC 

1251 CTCCGGCACC ACCATGGGGC TGGTGGAACA AGCCAAGAGT TTACTCAACA 

1301 AGAAAGCAGA TGGAGTCAAG CCCCAGACGA ATAGCACCAA AAACAGTGCA 

1351 GCCGCCACCA GCCCCAAAGG GACGCTTCCT CCTGCCGCCC TGGAGCCTCA 

14 01 AACCACCGTC ATCCATAACC CAGTGGACGG GATTAAGGAG TCTTCTGACA 

1451 GTGCCAATAC CACCATAGAG GATGAAGACG CTAAAGCCCG GAAGCAGGAG 

1501 ATCATTAAGA CCACGGAGCA GCTCATCGAG GCCGTCAACA ACGGTGACTT 

1551 TGAGGCCTAC GCATTCTACT TCGAGAACCT GCTGGCCAAG AACAGCAAGC 

1601 CGATCCACAC GACCATCCTG AACCCACACG TGCACGTCAT TGGAGAGGAT 

1651 GCCGCCTGCA TCGCTTACAT CCGGCTCACG CAGTACATTG ACGGGCAGGG 

1701 CCGGCCCCGC ACCAGCCAGT CTGAGGAGAC CCGCGTGTGG CACCGCCGCG 

1751 ACGGCAAGTG GCAGAACGTG CACTTCCACT GCTCGGGCGC GCCTGTGGCC 

1801 CCGCTGCAGT GAAGCCAAGG GAGGGGCACA GAATGGGGAA CAGGACACAG 

1851 GATCCTAAAC TCCAAGGGGA CTGTCCACCG ATGAACACTC AGAGTGGACA 

1901 CCATCTTCCG TCCACGCTGT GCCCAGGACA GCTGTCCCCA TCCATGAACA 

1951 CAGGGTAAAC ATCTGCCGGG CTCCGCACCA GTGGCTCCCT GGGCCATGGG 

2001 ACAGCGGCAG GGCTCACCAC GGACAGCACG TGGCCCAGCA GCCGGCCACC 

2 051 CTGGCGTCCT GGGGCCTCCT CCCCTCCTCT CCCTCTCACC TTGTCACCTC 

2101 CACGGAGCTG CCTGTCTGGG ATAATTTGGG GATTTTTTTT TCTGGGGGAT 

2151 AATTCTTTTG CATGACCCCT AAAGAGCAAG CCACACCGGT CTGCTAGCTA 

2201 GGTGTCCGCG GTGTGGTG (SEQ ID NO:l) 



FEATURES : 



5'UTR: 1-261 

Start Codon: 262 

Stop Codon: 1810 

3'UTR: 1813 
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Homologous proteins: 

Top 10 BLAST Hits 



CRA| 18000005245285 
CRA| 18000005199792 
CRA| 18000004938668 
CRA|18000004937301 
CRA| 18000005245287 
CRA| 18000005171302 
CRA| 1000737074531 
CRA| 18000005245288 
CRA|18000004964693 
CRA|18000005199791 

BLAST dbEST hits: 



/altid=gi | 
/altid=gi| 
/altid=gi | 
/altid=gi | 
/altid=gi j 
/altid=gi j 
/altid^gi | 6 
/altid=gi | 
/altid=gi| 
/altid=gi j 



5326757 /def =gb | AAD4203 
10835006 /def=ref |NP_00 
6671660 /def=ref |NP_031 
11120682 /def=ref |NP_06 
5326762 /def =gb | AAD42 03 
3668373 /def =gb j AAC7946 
688228 /def=emb|CAB6512 
5326764 /def =gb | AAD4 203 
466360 /def=gb|AAA81938 
4139268 /def=gb| AAD0374 



5.l|AF07880. 
1211.1 | cal. 
621. l| calc. 
8507.1 | Ca+. 
7.l|AF08192 . 
0.1 | (AF085. 
2 - X | (AJ252. 
8.1|AF08341. 
.1| (U06636. 
3.1 | (AF112. 



gi | 12801212 /dataset=dbest /taxon=960. 
gi j 12868201 /dataset=dbest /taxon=960. 
gi | 2053138 /dataset=dbest /taxon=9606 
gi | 10213950 /dataset=dbest /taxon=96.. 
gi | 9324431 /dataset=dbest /taxon=960.. 
gi | 12921378 /dataset=dbest /taxon=960. 
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EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 
From BLAST dbEST hits: 
gi | 12801212 Fetal brain 
gi | 12868201 Fetal brain 
gi j 2053138 Testis 

gi | 10213950 Lung small cell carcinoma 

gi j 9324431 uterus endometrium adenocarcinoma cell libe 
gi | 12921378 Fetal brain 

Tissue expression from PCR-based tissue screening panels: 
hippocampus 
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1 MATTVTCTRF TDEYQLYEDI GKGAFSWRR CVKLCTGHEY AAKIINTKKL 
51 SARDHQKLER EARICRLLKH SNIVRLHDSI SEEGFHYLVF DLVTGGELFE 
101 DIVAREYYSE ADASHCIQQI LEAVLHCHQM GWHRDLKPE NLLLASKCKG 
151 AAVKLADFGL AIEVQGDQQA WFGFAGTPGY LSPEVLRKEA YGKPVDIWAC 
201 GVILYILLVG YPPFWDEDQH KLYQQIKAGA YDFPSPEWDT VTPEAKNLIN 
251 QMLTINPAKR ITAHEALKHP WVCQRSTVAS MMHRQETVEC LKKFNARRKL 
301 KGAI LTTMLA TRNFSVGRQT TAP ATMS TAA SGTTMGLVEQ AKSLLNKKAD 
351 GVKPQTNSTK NSAAATSPKG TLPPAALEPQ TTVIHNPVDG IKESSDSANT 
401 TIEDEDAKAR KQEIIKTTEQ LIEAVNNGDF EAYAFYFENL LAKNSKPIHT 
451 TILNPHVHVI GEDAACIAYI RLTQYIDGQG RPRTSQSEET RVWHRRDGKW 
501 QNVHFHCSGA PVAPLQ (SEQ ID NO : 2 ) 



FEATURES : 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASNJ3LYCOSYLATION 
N-glycosylation site 

Number of matches : 3 

1 313-316 NFSV 

2 357-360 NSTK 

3 399-402 NTT I 

[2] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP- dependent protein kinase phosphorylation site 

Number of matches : 2 

1 48-51 KKLS 

2 259-262 KRIT 

[3] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches : 4 

1 47-49 TKK 

2 51-53 SAR 

3 358-360 STK 

4 367-369 SPK 

[4] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 



of 
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TGGE 
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109-112 


SEAD 
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262-265 


TAHE 
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400-403 
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[5] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

9-17 RFTDEYQLY 
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[6] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 3 

1 302-307 GAILTT 

2 332-337 GTTMGL 

3 390-395 GIKESS 

[7] PDOC00100 PS00107 PROTE I N_KI NAS E__AT P 
Protein kinases ATP-binding region signature 

20-43 I GKGAFS WRRCVKLCTGHE YAAK 

[8] PDOC00100 PS00108 PROTEIN_KINASE_ST 
Serine/Threonine protein kinases active- site signature 

132-144 WHRDLKPENLLL 



Membrane spanning structure and domains: 

Helix Begin End Score Certainty 

1 195 215 1.665 Certain 

2 319 339 1.301 Certain 
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BLAST Alignment to Top Hit: 

>CRA| 18000005245285 /altid=gi | 5326757 /def =gb | AAD42035 . 1 | AF078803_1 

(AF078803) calcium/ calmodulin -dependent protein kinase II 
beta subunit; CAM2 [Homo sapiens] /org=Homo sapiens 
/taxon=9606 /dataset=nraa /length=542 
Length =542 

Score = 1047 bits (2678), Expect = 0.0 

Identities = 516/542 (95%), Positives = 516/542 (95%), Gaps = 26/542 (4%) 
Frame = +1 

Query: 1 ^TTVTCTRFTDEYQLYEDIGKGAFSWRROTKLCTGHEYAAKIINTKKLSARDHQKLER 180 

MATTVTCTRFTDEYQLYEDIGKGAFSWRRCVKLCTGHEYAAKIINTKKLSARDHQKLER 
Sbjct: 1 MATTVTCTRFTDEYQLYEDIGKGAFSVVRRCVKLCTGHEYAAKIINTKKLSARDHQKLER 60 

Query: 181 EARICRLLKHSNIVRLHDSISEEGFHYLVFDLVTGGELFEDIVAREYYSEADASHCIQQI 360 

EARICRLLKHSNIVRLHDSISEEGFHYLVFDLVTGGELFEDIVAREYYSEADASHCIQQI 
Sbjct: 61 EARICRLLKHSNIVRLHDSISEEGFHYLVFDLVTGGELFEDIVAREYYSEADASHCIQQI 120 

Query: 361 LEAVLHCHQMGWHRDLKPENLLLASKCKGAAVKIADFGLAIEVQGDQQAWFGFAGTPGY 540 

LEAVLHCHQMGWHRDLKPENLLLAS KCKGAAVKLADFGLAI EVQGDQQAWFGFAGTPGY 
Sbjct: 121 LEAVLHCHQMGWHRDLKPENLLIJVSKCKGAAVKLADFGLAIEVQGDQQAWFGFAGTPGY 180 

Query: 541 LS PE VLRKE AYGKPVD I WACGV I L Y I LLVG Y P P FWDEDQHKL YQQ I KAGA YD F P S P E WDT 720 

LSPEVLRKEAYGKPVDIWACGVILYILLVGYPPFWDEDQHKLYQQIKAGAYDFPSPEWDT 
Sbjct: 181 LSPEVLRKEAYGKPVDIWACGVILYILLVGYPPFWDEDQHKLYQQIKAGAYDFPSPEWDT 240 

Query: 721 VTPEAKNLINQMLTINPAKRITAHEALKHPWVCQRSTVASMMHRQETVECLKKFNARRKL 900 

VTPEAKNLINQMLTINPAKRITAHEALKHPWCQRSTVASMMHRQETVECLKKFNARRKL 
Sbjct: 241 VTPEAKNLINQMLTINPAKRITAHEALKHPWVCQRSWASMMHRQETVECLKKFNARRKL 300 

Query: 901 KGAILTTMLATRNFSVGRQTTAPATMSTAASGTTMGLVEQAKSLLNKKADGVKPQTNSTK 1080 

KGAILTTMLATRNFSVGRQTTAPATMSTAASGTTMGLVEQAKSLLNKKADGVKPQTNSTK 
Sb j C t : 301 KGAI LTTMLATRNFS VGRQTTAPATMSTAASGTTMGLVEQAKSLLNKKADGVKPQTNSTK 360 

Query: 1081 NSAAATSPKGTLPPAALEPQTTVIHNPVDGIKESSDSANTTIEDEDAKARKQEIIKTTEQ 1260 

NSAAATS PKGTLP PAALE PQTTV I HNPVDG IKES SDSANTT I EDEDAKARKQE 1 1 KTTEQ 
Sbjct: 361 NSAAATS PKGTLP PAALE PQTTV I HNPVDG I KES SDSANTT I EDEDAKARKQE 1 1 KTTEQ 420 

Query: 1261 L I E AVNNGD F E A Y A FYFENLLAKNS KP IHTT I LN 1362 

LI EAVNNGDFEAYA FYFENLLAKNS KP I HTTI LN 

Sbjct: 421 L I E AVNNGDFEAYAKI CDPGLTS FE PE ALGNLVEGMDFHRFYFENLLAKNS KP I HTT I LN 480 

Query: 1363 PHVHV I GEDAAC I AY I RLTQ Y I DGQGR PRT SQS E ETRVWHRRDGKWQNVH FHCS GAPVAP 1542 

PHVHVIGEDAACIAYIRLTQYIDGQGRPRTSQSEETRVWHRRDGKWQNVHFHCSGAPVAP 
Sbjct: 481 PHVHVIGEDAACIAYIRLTQYIDGQGRPRTSQSEETRVWHRRDGKWQNVHFHCSGAPVAP 540 

Query: 1543 LQ 1548 
LQ 

Sbjct: 541 LQ 542 (SEQ ID NO : 4 ) 
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Hmmer search results (Pfam) : 
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Parsed for domains: 
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1 GAGCTGCTGT GTCTCTGTCC CCAGGGGCAG AGGGGCTGTG GGGTTGCAGG 
51 CTCAGCGTCT GGGACTCTGG GGTGAAGGCT CAGCCATGCC CTGCAGACAC 
101 CATGGGGCAG GGCTCAGACC TGTGCACCTG TCTCTTGCAA ACCACTGTTT 
151 TCTCTGTTTT GTAACCCCCC ACCCAACCCC ACATAACACC TCTGGGTTTA 

2 01 AACAACATGC ACCCTTGTGC CGGTCACCTC CCTGCAGCCG GAGAACCTGC 
251 TTCTGGCCAG CAAGTGCAAA GGGGCTGCAG TGAAGCTGGC AGACTTCGGC 
301 CTAGCTATCG AGGTGCAGGG GGACCAGCAG GCATGGTTTG GTGAGTGCCA 

3 51 GGGGCAGGGT GTGTTGGCTG GCAGTTGGCA GGGCAGGAGG TGATGCTGAC 

4 01 AGCCCCTTGT GGCCTCTTCC CCTCTCTCTA GGTTTCGCTG GCACACCAGG 
4 51 CTACCTGTCC CCTGAGGTCC TTCGCAAAGA GGCGTATGGC AAGCCTGTGG 
501 ACATCTGGGC ATGTGGTGAG GCCTGGCCTG AGTTGGTGCG GGGCAGGGCC 
551 TCGGGTGTTT CAGGACTTCC CACCTACATC CTGGAGTGTG CAGTGGCCAG 
601 CACGTCTTGC TCTCATCTGG GTTTATCTGT GTCAGACCTG CCCTTGAGCT 
651 GCCCTGGCAG GGGTCTGCCC ACACAGCCAA GAGCCCCCTT TCCACCCAGA 
701 TTAGAATTGC TCACATGAAC CTGGCGCACC CCAGTGCTCG CCTGCGCTCA 
751 GCAGAGGTCT GGTCCAGAAG TGTGGTGGGT GGATGGGAGT GGAGAAGAGA 
801 GGTCAGGGGC TGTTGGGCCA TGGGCAGGGC CACCTCCTTG GGTAGGGGTC 
851 TCCTCCCACA GAGGTGGGGA GCAGCAGAGG GGCTTGACAT CACCCTCATC 
901 CCTGTGATAG TGTGGGTGTG GGGCAGAGGT CAGGGGGCCG GCTGTGCCCT 
951 TCTACCCCAG TGTCTGCTGC ACAGGTGGGG GCAAAGGAAT GCTGAGGACC 

1001 CCAATGCCCT CCCAGGGCCA CAGGAGCTAG GCAGTGAGGG TGCAGGGCAT 
1051 GGGCTTCATG GACGGTGGCA CCCTGCAAGT GGCTGCGGTG CTCACAGGCC 
1101 CCATCCGCAG GGGTGATCCT GTACATCCTG CTCGTGGGCT ACCCACCCTT 
1151 CTGGGACGAG GACCAGCACA AGCTGTACCA GCAGATCAAG GCTGGTGCCT 
1201 ATGACGTGAG TGCACCAGCC CCTCTCTGAT GAGCTCCCTT CCTCCAGGTG 

12 51 TGGCCGGGTG AGGGCAGCGT GGGAAGAGGC TAGGAGTGGG GTGAAGCCAC 
1301 CTGTGGCCAG GTCCTGGGTC CTGCTCTCCC AGATTCGTGG CTGGAGATGA 

13 51 AGCCCCTTGG AGAATTCTTG CCCCTGCCTG AGAGGGAGCT TCAGGCCCGG 

14 01 CCGGGGCGCT GTTTCCTTCT GCAGTTCCCG TCCCCTGAGT GGGACACCGT 
14 51 CACTCCTGAA GCCAAAAACC TCATCAACCA GATGCTGACC ATCAACCCTG 
1501 CCAAGCGCAT CACAGCCCAT GAGGCCCTGA AGCACCCGTG GGTCTGCGTG 
1551 AGTCGCCCTT GGTGCCCATG GTGGGGAGGG GGCTCCTGGT GGAGATGGCC 
1601 TCAGACCACT CCCCTGGCAA GGACCCCAAG AGGGTCCTGT TCCTGACATC 
1651 CAAGAGCTCC CTTGGGTCCC CTGGGTGCTC CTTGTGGCCT CTGGCTTGGG 
1701 ACATACCAGC ACGTTTGTGA GGCCTGGGGC TTGGAAGGCA TTAGAGGGTA 
1751 GAGGTGATCC CTTCCTCCCA ACTGCAGTCC TGTCTGTGAG GGGCAGAGTG 
1801 GACGAGGCAA GGGAGAGACG AGTCTTGAAG TCCCAGGCGG GTGGGGACAG 
1851 ACAACCCTTG CCGCAATGGT GGCCGGTGGC TCTTGGCAAG TGGGGACCCC 
1901 AGGGTGCCAC AAGCCTTGCC ACCCTGGCCT CTCCCCTGTG CCTCGGGCTC 
1951 GGCTGCCATA TGACCACCCA TTTCCCCACA GCAACGCTCC ACGGTAGCAT 
2001 CCATGATGCA CAGACAGGAG ACTGTGGAGT GTCTGAAAAA GTTCAATGCC 
2051 AGGAGAAAGC TCAAGGTGAG GCCCTGGCCC CTAGTCCCAG GCACGGCCAT 
2101 GCTTCTCTGT GTCCCTCTGG GCTGGAGCAG GGGGGCCTTG GGGGGTCTGG 
2151 GCAGACCTAG GGGTTACTGC TGCCCCCAAG ACTGACTGTT AGCAAGTCCC 
2201 AGACTGGATG CATCAGGTGA ACTCAGGCCA GCTTGGGAAT GAGTCCAGAG 
2251 GGGCCCTGGG CCAGGTGTGG CTCCTCCTAG TTGTCTGTGC CACCTCCTAG 
2301 CAGCCCTTGG AGGAGCTGTC CTGAAGCGCT CGCTGTGGGC TCCTCACCCG 
2351 GGCTCTGCAG GCAGCACTCA CCCTCTGGCA GTCACACTGT TTAGTACAAG 
24 01 CAAGTCCGAA GCTTCCGGCT CAGACAGGTT TGGTAAGGAG AGCAGAGCCA 
24 51 CACACACTGG TCTTGGGTGG GCTGGGGGAG TTCTGGGAGG GAGGTGGGTC 
2501 CCAGTAGGGT ATCCAACCTG CCTGCTTTGG TCAGGGCTGG CTCCGGTGAC 
2551 CGCACACTGG CAGTCCCTCT ACTTGTGGGT TCCGGGATGG GGACTTGTTG 
2601 CCTGACTGCC CTCTGCTGGT CTCTGAGCAG TTCTCCCCGG AAGCCCCAGG 
2651 ACTGTTGCCC TGTCTGAGCC TGTCAGGAAA AGAAGGGGCT GTCAGGGAGC 
2701 TGGACCCCAG AGGAGCTGCC GTGGTGACCA GCTGTTCTGG TGACCCCTGA 
2751 GGCTTGAGGG GTCTTGAAGC AGCTAGAAGC TGTAGTTGGT CAACAGGTTT 
2801 AGGCCCAGGG TGTGTGTAGT TCTGGAAATA GGTGATCTGT CTCAGTGCGG 

2 851 CTGCTGGCTT CCTGGAGCTC TTGCCTCTCT GGAAGGCTGA GGTCATGTCA 
2901 GCCTCATGAC AATGAGGCTG AGCATCTGGG CAGGAGGACA GGGGTCTTAT 
2951 CCTGGCCAGA AGCCAGCAGG GAACACTGAT GGGATAGCCC CGGTTTTATC 
3001 TGTGTCTCTC CCCAGGGAGC CATCCTCACC ACCATGCTGG CCACACGGAA 

3 051 TTTCTCAGGT GAGCCTTTCT TCTCCAGGGA GACAGGCGCT GCCCCCTCCC 
3101 TGCTGGCCCA CGCAGGAGAG CGCCTCCTTC CTCACCAGCC TCTCCACTCC 
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3151 TCCTCTGCGG CAGGCCTGCC CTCGGCGTCT GCCCTCAGCT CTGAGACCCA 
3201 CTGCCCACCT GGCCCCGCTG GGCTCCCACC TTGGGTGATA CCACAGGGTC 
3251 CAGCCCCCCG AGGCCATCAC CTTCGTGCTG GGTCTGTGTC CCTCCACCCC 
3301 CTGAACACGA GCGTCTGTGC TGCCCCACTG GGGCTCACAG CATCGTGTGT 
3351 GTCTGTCCAG GCGTTTGTCG GGCATCTATG TGGCCTCCTT GTCATTTTGA 
3401 GTGCTCTGAA CATTGTGTTT TGTGCGGGAG GTGGGCAGAA GGGATGCGGG 
3451 GTGATGCGGG AGGCTCGGGG GCCTCCTTCC AAGTTCTGGA TGAGCTGCAG 
3 501 CCTCCTGTCC CGGCTGCTCA GGGTGGGTGG TTGGGAAGCA AGTTCTCTTG 
3 551 GCAGGGGGGT GGGGTCTGTT ATAGACCCCT GAGGCCCAGG GCGCTGGCAG 
3 601 ACCCATCGGG GCATGATGTT AGCCCCGGAG TGGAGCCGGC AGCCCAGGTC 
3651 TGGACAAGCT GTACCTGTGG CTTCTCCGTC GTCCGACACT CCGTGTGCGA 
3 701 GCGTCTGTGA TCCGTCTCTC TCGTTGTCCG TTTGCATCTG GTGCCCCCCA 
3751 CCCGCCATCC TGTTACTTTT GCTGTGATGC TGTAATGCCG GGAACGCGTG 
3801 CACACGGTCA CACCAACACT AATAGGACTG TCCTGTCTGC TGTGTGCTCA 

3 851 CCACACCCTT TGGGCATGAG AAGCCCCCAC TGGGGTTTTC TAAGGAGAAA 
3901 GGAGGCAAAT GCTTTTCCGT GTCAATCAGT CCAATCTTGT TTTCACTCTC 
3951 TTGAGCAAAG GATTCTGGAA CCATCTGTCA CCTAAACTTT AACTCTAATC 

4 001 TTCTTCTGCT TCCTTTGTCT CTTTTCTTCC CTTACCTCGC CCACCCCTCG 
4 051 TCTGTGTCCG CCCACCCCTC CCTTCCCCTC GTCTCTAACC CGGTGCTAAC 
4101 AGTGGGCAGA CAGACCACCG CTCCGGCCAC AATGTCCACC GCGGCCTCCG 
4151 GCACCACCAT GGGGCTGGTG GAACAAGGTA GATGTGTCTC GACCAGCGTC 
4201 CCGCCCGCTC CCGCCCGTCC CTCCTGCCAG CATGCAGCCC CCTGCTGCAC 
4251 GCAGCCGCTG GCCGGGCTCC AGAGCCGCCC CAGAGGCCGC CAGGCCCCCG 
4301 GGAGCCCCTG CTCCCGTGTG GTCACATCCC AGCAGAGCCC ACCACAAGGG 
43 51 CAGGGAGGCA GCCCCCAAGG CTCCTCGCCT GTAAGAGGAG GGGCTGGGCT 
4401 AGGTGGCCCC TGGGCTACAC CAAGCCCTTC TGGTCCTGGC CCCCGAGGTC 
4451 TGGGGGTCCG GAGACCCCCA TTAAGAATGG CCTGGGCCCC ACAGGGAGCC 
4 501 ACTGGGCCTG CTGCTGGGGG GTCTGAATCC TGAAAGGAGA GCCTTGAGGA 
4551 GCAGAGCCAG AGAGGCAGAG GCCCTTGGGG CAGACACACA CCCTGCCCCT 
4601 CTGGGGCCGC ATGGAGACGG TGGTCTGTGC TGCTGAGTCC TACACATGCA 
4651 TGTCTGCCCT GAGCATCCCC CCAGGACAAG CCGCTCTGGA GTGGGTGAGG 
4 701 GTTTTATGCA CCCTGAGGAG ACTTTCAAGG CTTCCTCTTG GGTTGTTTCT 
4751 GCAAAGTCCT CCTCCCCTGG CCTCAAACCC TGTGAGGGAA AAGGCCGGCA 
4801 CTGGCCACCT GCTCCTCTGG GCTGTGCGGG GCCAGAGCCC AGAGGCCCAA 
4 851 GTTGGCTTCT GCCCACCTGC TGGCTTGTGA CCATGGGCAG ACCCCATGAG 
4 901 GGCTAGGCGA CCCCAAGACC TCCTTGCAGC TCCAGCCTGA GCTGAAGGCT 
4 951 GGTGAGAGCT TAGGGCAGGC CAAGCTGACA ACGCCTGGCC ACAGAACACA 
5001 GAGGGCTACA GGGGTGACCC CAGATCCTCC CTGGGCTGAG CTGCTGAGTT 
5051 CCCTGTCGGT GCCTCCAACG TGGGCTGGGG ACCCGGCAGA GGTTCCAGGG 
5101 TGCTGGAGAC TGCCTTCCCC AGGCCTCCTC ATGACCCACA GGGTGAGCAG 
5151 CCTGGCCTTC CCAGCCAGAG AACCCTCCTT CTGGGGAGGC CCAGGGCGTC 
52 01 CTCGGGGAGG GCAGTCTATT CTCCTCCCAT GAGCCCAGTG GACGTGTCTA 

52 51 GCAGGCAGCA CCCCGGGAGA GCCCTCCCAC GTCTTCTCCA TTTGACAGGC 
5301 CTTTCCAGAG CGCAGGCGGG AGGGGGCTGT GATTAGAAAA GAGTGAGGCT 

53 51 AGTGGCTTCT GGGGAGGCAC TGCTGCCCAG GGGACAGTGC TGAGAGACAG 
5401 CTGCCTCTAC GCTGCCCTGT GCCCGGGGCT CCCGCTGCAA TGCCCGCCTG 
5451 TCTGCAAGTG AACGTGGGGC GACGGTGCAT GAGGCCCTGC ATGTGTGGCT 
5501 CCACCCTGGG CGCCGAGAGC AGCTCTGTCC TGGAGGGTGG TCAGTGCATG 
5551 TGGACAGAGC CCAGCATGGC TGTCCTGGGT GACCAGCTAA GGGGACAAGG 
5601 CAGAGGCAGG GCTGAGAGGA CCACCCATCC TGCTAGGTCA GCCCAGCTCA 
5651 GCCATATCAC ACGGCAGTGA GCATGGAGCT CAGTTCTCTG CCAATGGCAG 
5701 CTGAGTCTAG TACCATCCAG TCAGAGTCTG GTACCAGCCC ATGTGGCATA 
5751 GCCCCCTCGG CCCGCAGAGA GACCCCGTCT GTCGAGTGTG CTTCAGTTTG 
5801 GCCTCTGTGG TCTCTCCTGC ATTGATCAGG TGTAAGGGCA TAGGAGACCC 
5851 AGTGTCCGGC CAGCTGCAGG GTGGCAGCAG TTGCCCCGGC CTGGAGACCC 
5901 GGGAATGGGC AGTGCCTTCC CAGGATGGAG GGCAGAGGGT CTCTCCTTGT 
5951 CCCACAGAGG CCTGCAGAAC CCCCAACCCA GGTGTCTGAG ATGCCTGTGA 
6001 CTGCTCCGCC TACCCTGGGC TCCTGCGGCA CCTAACGCAT GCTTTGAACT 
6051 TGAGACACAG AAAGGAAGTT CCCGTGCCCT TGAATGCTAG TGTAGATGGG 
6101 CATCGACAGG ACTCTGGCCA CGGTGAATCT GGAGTTAGTC CCAGGCAGAG 
6151 ATGTGAAATG AGCAGCCCCC CAAAAAATGG TTGGCCGGGA GCCATGCACT 
62 01 CAGGAGGGCC GGGCCCATGC ACCCCACACT GCGCCCAAGG CGTGCACAAG 
6251 CGATTGTTTT AAAAGCGGGT TCACAAGGAA GGATGTTTGG GAACTGACTG 
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6301 AGACAACAGG GACGTCTGCT GCAGGGCTTC CCAGAGCTCT GATGGCAGCG 
6351 TCGGCCTGAG TCCTTCGAGG AGGGCTGGTT TGTACGTGGC ATTTGCTGCC 
6401 CACTGGACTG TGAACTTCTG TCTTTTTATT TCCCACTGCT GCTGTGGTAC 
64 51 ATCTCCAGTA GCATAGTTTG GAAATGCAGG TTTTGATAGA CTCAAGGATC 
6501 TAAATAGAAC CCTCTTAGTA CCAAGGACTG TCCGGGGTCT CTGCCAGCCC 
6551 CGCCGATGGG CCTAACTGTG GTGCCTCCTT TCCTGTGAGA ATCTTCTGAG 
6601 GACATGCCCG GGGAAAGAGC TCAGTTCTGC TGCTGCCTAG GGTGCCATGC 
6651 TGGCCCCGGT TCCAATGCAG AGCCTAGCTG GAAGTACCGC TGGGTTGGCG 
6701 GAGGCTACGT GCCTGACTGT CCCCTCGGGG GTGGGGTGGA ACTAGCCTTC 
6751 TGAAACCGCC TGCTTCAGTT GGCCACAGCT TTTTGAAATG TGTGTTTCTG 
6801 GAAGGGACTG GGTCCCTTCC TTGCCTGTTC AGCTCCCCAC GACAAATGTC 
6851 CTCAAGGCGA GGCTGGATGC TTCCTTCCTC AGGCTCCTAG GAGGAGCCCG 
6901 TCCCCCAGCT GTGTCGGGCA GCTGGTCACC AGCAAGGACA GGATCCCTCA 
6951 GCTGCAGCCT CAGGCTGGCT GGCACTGGGC GGGTGTTTCT GGGATGAGTT 
7001 GTGTGTACTG GAGATGGGAG GGGAGCTGAG AGGGTGGGAT GCACAGACAG 
7051 GAGAGGGGAC TGTGGGGGTC CTGGAACCCT GAGTTCCAAG TCTTCAGGAC 
7101 TCTCCCTCCA TAGCAAGTTA CAGGGAAGCA GATTTGAGCC ACAGGGAAGC 
7151 AGATTTGAGC TGCAGCGAGG GGGAGGGTTT TCAGTCTGTG CTATAGGGAA 
72 01 GTGGGCAGTC GGCATTTCTG GTCCTGGGAA CTCACTGGGC AGGGCTGCCT 
7251 TGGGACATCA GGGAGGTGGC GCTGTGCTCA GCTTCACCAG GAGGGGCCTT 
7301 AGGCCTGGGG ACGGAGAGTG ATGCCTGAGG CCCCTCTACT TCTCCATGGA 
7351 TCCTGGGAGG GACTCCTGGG CTGGATACAA AATTGTTGAG AGTTAAGAGA 
74 01 TCTGTGAGGA AGGGGAGGCT GGGAATAGAA AGTGTGTGCC CACTGCACAT 
7451 GGGGTCCGCA GGGCCACGTG CAGCCACTGC GCAGGCACAA CCCCAGTCCC 
7501 CACAGAGCCC AGGAGGGGCC AGAGCCATGG AGGAGGCAGC ACTGGGCATT 
7551 TGGACAGGGA GGGGGTGGTC AGCAGGCAGC AGGCCCAGGC CTGTCTATGC 
7601 CCTGCGGGGT GCAGCCTCCT GATCTCCACG GCAACCTGGA GCACCCAGCG 
7651 TCAGAAC CAC CGGGAGGGCT TATGGAACAG ATGTCCAGCC CTGCAGAAGT 
7701 TCTGGCTCAG GAGGGCGGGG TGGGCCTGGG AATTTGCATT TCTGACTGTA 
7751 CAGGGCGATT CTGCTGCTGC TGCTGCTGCT GGGGTTGGGG GAGGATCCCA 
7801 TTTGAGAAGC GCTGCAGTCC TAGGTTGAAA CGTGCCTGTC TGTCCCCACC 
7851 CAGGCCTGCA TGGGCAGCAC GGGATCCCCA GGCAGGAGGA CCCAATTTCA 
7901 TGGCCTGGCC AGCCAGGGTC CTGGAGCCAG GCGGTGGGGG AGGGATGGGG 
7951 GATTGCTGTG CCACCTTCCT TCCCGGCTTG GCCCGGGGGC AAGCATCCTC 
8001 ACACTTCCCA TGTCGTCATC CCCTTGGCTC CAGCCTGGCT GCCTCTCTAA 
8051 CCCTGCTGTA CCGGCTGGCC GCATGGCCCT GGCTCTTTTT GGTGAGCGTG 
8101 GTCCAGGACT GGTGACCTGT GAGTCCTGGG CCCGCAGTCT TGCGCCCCTG 
8151 CCCGAACCAA CACAAATCTT GTTTTCTCTC TCTCTCTTCC TTCCTCACTC 
8201 CCTCCCCTTC TCACCTTTCC TTTTCTGTAA GGTAAGCTGA CTTCCTCTTT 
8251 TGGTTTTTTA TTTATTTTTA TTTTTTAGTT CTGTAATTAA AATCCTAACA 
83 01 GCCATGGAGG GTGTGGGCAC CGGGGGCTGG GGCCAGGCCC CTCTGACCTC 

83 51 TGAGGGGGAA TGCTGGGTGA GGCAGGGGCC CCGCTGCTGG GACCAAGTAT 
8401 CCTCAGGGGC TTGTGGGCAG AAAGGCCTGT GCTGGCCCCA GTCAGTGCAC 

84 51 AGAAGCGGCC CCAAGGCCAG GGCTGCTGGG CAGCTCGGAA TGAGGGCGAG 
8501 CAGGGCTGCC CTTGGTGCCT GAGCCAAGGA GCCAATGGGA CAGACCTCTG 
8551 AGCCTGGGTG CCAAGTATGA GGTCTGAGAC AGGGTGAGCG CCTGGGCTGG 
8601 GACAAGGCCC TCTGAGTGGG CGGCCAGCTG CAGCCCACCC ACCCCTACCC 
8651 CAGGAAGGCA GGGCCCGGGA GGGCATGACC TCTGGGGTGC TGGCTCAGCT 
8701 GCCCCCACCC CAACCTGACA CCGCTAGTCC TGAGTTCCCA TCAGGGAGGA 
8751 AGCAGCATCC TGCCTTCCTC TAGGAAGAGC TTGCATGTGG CCCAGAAGCC 
8801 AAGGGGGCTC CCCAGCACCC ACGGGCATCT CTGGGTCTGG TCAGAGGAGA 
8851 AATCTGGATG CTTGCAGGAG CCCCAGGGTC ATGGAGGAGG CTGGAGACAG 
8901 GGCTGTCCTG GGGTGATGGG ATGGCCCCCC CACCTGCTCA GAGCCAGCCT 
8951 GGGTGCTGGA ACCACACTTG CCTCAGGACC CTGGGCTTGC TCCTGGGGAA 
9001 AGAGTGGGGT CAGGCAAAGG GGTGGGGTTG CGCTGCAGCG AGACCCAGGC 
9051 CCATCACTCA CCATACCTTC TTCCTCCCCA TGCAGCAGCC AAGAGTTTAC 
9101 TCAACAAGAA AGCAGATGGA GTCAAGGTGA GGCTCCAGCC GGGCCCTGTG 
9151 GTGCCGGGGA GCCCAGAGCC TGCAGCTTCA CCCCCACGCC CTGGGGCTCC 
9201 TGCTCTGGAG TCCCCCTCCC CCCATGCCCT GAGAGACACG GGACAGGGAA 
9251 TGGCGAGTGA GGGGCTTCTC CCACCTAAGA GTTCCTCTTC CCTCTCTCCA 
9301 CAGCCCCAGA CGAATAGCAC CAAAAACAGT GCAGCCGCCA CCAGCCCCAA 

93 51 AGGGACGCTT CCTCCTGCCG CCCTGGTACT GAGCTCCTCA AATTCTGCCT 

94 01 CTCAGCCCCT CCTACGCCCC TGGCTGTGTG ATTGCCGCTG GTCAGAGGGG 
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9451 GCCGGGTGAA GGTGGGGTCT GGCCCCGCCT GGCCTGTCTG ACAGCACTCG 
9501 CATGGCCCCC GCCCCTCATC CCTCACCGGT GGTGAAGTGG AGAGAAGAGG 
9551 CCACTGTTGT GGGGGGCTCC AATTCAGACA GGTTTAGGAC TGCTCTGGGG 
9601 AGCCCCTGGC TGAGACCCAC AGATGTTGGG GTGCAGGGGA GAGGCCCAGC 
9651 CTCCCACCCA TGTTGACTTG TGGATGTCTC TCCAGGAGTG TTCAGGAAGT 
9701 CAGTGAGGCA GAAGATACCC TCTCCCCACC AGGACCCCAC CCTCAGCTCC 
9751 TCCACCATCC TCAACAGGCC GACCCACAGA CCACTCCGAA GGTCTGGCTT 
9801 GGTGGGGCTG GGCCAGGATC TGCAGGGGGA ACAGCCCATA GTGGCACATT 
9851 CCACGGCCCA TGGGGAGACG GGGCCACGGT GGTGCAGTAG AGAGGTGTCT 
9901 AAGCCAGTGG CAGCCAAGGG GAGGGCTTGC CGTCACCTCT GTGTTCCCTC 
9951 AGTGCTGCTC TGTGGCTGCC TGAGAGGCAG GGCTTAGGGG CTCCCTGCCG 
10001 GGGAGGGGAG GGGTCCCCAC CATGCTCCGC TCCAACTGCG CCCCTCAGTG 
10051 CCCCTTGCCC TGGGGGCTCC TACAGGTGAA CCCTATAGCA GTACTCCCAA 
10101 GGATGTAAAG TTGTGGCTGG TGGGTGCCGG CCTTCCTGCT GGGGCGCTGT 
10151 GCTGTGTCCC CTCAGCTGTC CTAAGAGCTT TGGGGCTTGC TGGCCCGTAG 
10201 GTCCCCATAT TTGCTGGAAG CAGGCTTGGT GTCCCCTGAG AACCCCAGGC 
10251 CAGGCTTCGG GAGCCAGCCC CAGACCGCCC ACGGGAATAC TGGGTTTGCC 
10301 AAATGGCCAC CTTGAGACCC AGGAGAGGAG AGCGGTCCTG GGAGGGGCGA 
10351 GCTGCTCAGA GCAGCCAGGC CGTGGCTGGA GGGTGGCCTG GTGCAGCCTA 
10401 CCTAGGGCCT TCCAGTGGCC AGGGCAGCCC ACGTGCCAGC CTCACAGCCA 
10451 GCCCCATCTC GGACCCTGTC CATCCCATGT GCCACCGCCA CCCCCATGAC 
10501 ATCTTCAAAC CTGTGCCCCC CACCACGCTG GGGCACAGGT TCAGGCAGTA 
10551 AAGGGTAGGG AGAACCCCTC AAGACCGAGC CTGGCTTCTC TGGCTCCCAC 
10601 ACACATTGTG CAGCTTGTCG GGGCCCCACA CGGTCCATCT CCCACCCTGG 
10651 ACAGCAGCAC CTCCGCCAGC CTGGACAGAG CTCCTGTCCA TTCCATCCCT 
10701 GCCGGCTGAC CCAGGCTCCT CCCCCAGCTG CTCCACGCCG CCTCCATCCC 
10751 TGTCCCCCAC TCTGCTCTGC ACTTCTTTCT CGCAGGCTCT GGCCACCCAC 
10801 ACCTCCTCTG TCTCCCTGTT CCCCTCCTGG TGGTCTCCGC TTCCTCCTCT 
10851 TCTCACTTTC CCTCTCTTTC CTTCCTCTGT GTCTTCCTTC TTCTGTAGGA 
10901 GCCTCAAACC ACCGTCATCC ATAACCCAGT GGACGGGATT AAGGTACTGC 
10951 CCCACTTTCC TCCTCCCGCT TTCCCCAGGC AGGAGGCTCC AGGCCAGGAG 
11001 AGAGGTCTGG GGCAGCATTT GTGCCAGAGT GGAGGGCAGA TGTCCCATGG 
11051 CCCTGGCCGC CCCTCCCCGC AGTACGGTAG GGCCCCAGTC CGTCTTCGTG 
11101 GGCAACAACA GGACAGACTG GCTCAGGCCC CAGGCGCGCC CCTGGAGGTG 
11151 CTTGGCACAG TTGCGCCCGG TCCCCATGTG GCCGACACTC TCAGACCAGG 
11201 GCTCTGCGTG TCCCACCTAC GGCAGGCAGT AGGGCTTCCT GAGGTCTGGA 
11251 GCAGGGCCTG CATCTCAGGA GCTGCATCCT TGGCCCTCCT GGCTGTCCTC 

113 01 CACCCCACCT CCCTCACGTG GCCCCCAGTG CTTCCTGCTG AGCAGACCCT 
11351 CCCTCCTCTG CTCCCCTCTC TGCTCTGGCC ATCAGCTCCC ATCACATTGG 

114 01 CATCATCACT CTGGGGCCAG GGAAGGGGCT GGCTCTCTGG GGTGGTGGGA 
114 51 GGGATGGGGC CAGCAGCCAA GCCATTTCCA GGACTTCCAA AACAGCGCCA 
11501 CTACACCCAA CACGGCCCTC CAGCCCAGCT CCCACCTAGG CCTGGGCTCC 
11551 TTACAGAGCC CCCAGAGTGC CTCTGTGGGG ACCCCCCACT TCCTTCTGGC 
11601 CAGTGCCACC ACCCAGCCCA TCATCAGAAG ACATCTTTCT CCATGGCAGG 
11651 GACCAGGGGG TCCAAGGGGC ACCCATGGTG CTAGGCACCA GGGCCTGGGC 
11701 ATTCTTCCCA TCTGGCAGCT GGGGATGGGT GCCCCTGGGA CCCGTGTGTG 
11751 TCTGGGGTGG GTCATGCTCT CTGCAGGACT CCTAAACAAC CTTCTGGGCT 
11801 GTGGTGAACT CTGAGCCTGC ACCTAAAAGA CCTGTAGTTC TGGTCTAGGG 
11851 CCTCCAAGCA GTGTCCAGGC AGTGTCCAGA CCAGGGGGCG GTCCCCCAGG 
11901 GACCTTGTAA GATGTTTCCT CTGAGGAGCA GAGCAGGCCT CCTGGGGACC 
11951 TGGGGGATGG TCTTTTGAAG GGCAGCAGCC CTGGAGCAGG GTGGGAGAGT 
12001 CTGGGGCCAC CTCTGCCCTC TAAGGCCACC TGAGAGGTGA GGCCGGGGCC 
12051 TGACTGGACG TCCAGTCCCA GAGGGGCAGG TGCCCTGAGG GAATGTGGGC 
12101 GACAGGAATG CTCTGCCTGG GGCCAGGCCA AGGTTCCTGG AGCCCTGTGC 
12151 GGATCTGCAG AGCTCCTGGG AACGCCTCAC CCTGTATTTT GGATGACACC 
12201 GGCTGCTGCT TCATTGGAAC CAGCCAGTCC CATTGTGTTT TACGTCTTGG 
12251 AATTTCAAAA AGCCCATTTT CCTCTCTTGT TAAAGAGTCA GCTGAGCATA 
12301 CCAGTCTCTC TGCCAGGCTC ATCTTGCTGG GAGAAGTGGA GCCCTCATGT 
12351 GTTGGGGATG CAGGGTGGCC ACAGCACTAG GGTGGCAGGG CCGGCCTCGG 
12401 ACTCCGTGCC AGCCTGTGCT GGCTGCCGTG AGAATGCACC CTGGTGAGGG 
12451 GCGCCCTCCC AGGGACCAGC ACAGAACTGG GTGTCTTCTC CGGTCACTGC 
12501 CGCATGAGGT CCACAGAGCT GGGGCCCTGC AGCCGCCAGA GGGCATGTCC 
12551 CCTGAGCCCC TGGCCTTTAA GCCCCGTGGA AGCAGCCGAG GCAGAGATCA 
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12601 GCTTCAGAGC CTGGGCTGGT CCTGACACAG GCCCAGCCCT GTCCACCTGC 
12651 CCTCAGCCAC GTCCCACCTA TCCTTGGCCG CATCCTGACC CGCTGCCTCC 
12701 CGTGTTTCCT CAGGAGTCTT CTGACAGTGC CAATACCACC ATAGAGGATG 
12751 AAGACGCTAA AGGTACCTGC ACTTGAGTCC TTGCCCCCCC AGCGGCCTTG 
12 801 GCATTGCTGG GTTGCTCTTT GAGGTGGGTG GGACTTGGGC AGGGTCAACT 
12851 CTCCTGCGAC GCCTAGTTTA TGCATGTGTT GAGGGGCTCA GGGACCCTGT 
12901 AGCTGTAATC CTGCTCCAAG CCTGGGTGTC AGGCCTGCCC AGAGCGGAGA 

12 951 AGCATGGCAG AGATGACCGA CAGCTGGGCA GTCTCGGTCA CCGCATCCAA 

13 001 GTGAGGAAGC CACGGCTTTG CATGGAGGCA GGTTCTCCAC ACCAGGACCC 
13051 TCACGGGGAA ACAGGCCCAT GGGTAGAATT TGTTCCAAGA TGCTGTCCTT 
13101 GTCTTAAAGC TCCTTAAGCT TGCGTTTCTG TCCAGCATGC ACTTGCCAAG 
13151 TGGCCGGGCA GCTGGGTGAG TGTTTCCGTG TTTGCCTTTG CTTAGCCAGG 
13201 AGTGTCCTGC TGCGGTGGGT TTCTGCACCA CAGATTCCAG GGCCCCCTCC 
13251 CTTGCTCACC CAGGCCAATG TCTTGTGTGT TCCCCAAGAG GCCCCCAGGG 
13 301 CACCAGGCAC TGGGGCATGC TCCATGGATT CTGCCGCCTC CAGACCACCC 
13351 ACATGGGGCC TCCTGACCCT CATCGCTCAC ACGGTCACCT AATAAGCCTT 
134 01 ATGCTGTTCT CAGGGCTACC CTGGTGCCCA AAAAGGGTCA GCCACTCTGC 
134 51 CAGTTTAGGG GAGAAAACTT CTCACCTGTC CAAAGCATAG CCTTGCTCCT 
13501 GCCCGGCCTA CCCAGCTATG ACACTGTCCC TGAGCAGAGA TGAGCACAGG 
13 551 ACTTTGGGCC CTGGATGCCG GAGAGTGGGT GTTTGTGTGA TTCCCCTGCA 
13601 GTCTGGAACA GGCCCCAAAG GCAACAGCAT GAAGGCTGTC CAGAGGTTCT 
13651 CCATCACCCT CAGCCGAGTG GGGTGCTGAG CAGTGAGGGA GGGGACCTGG 
13701 GAGGGGGGCC CAGCCTGGAT CCTGCAGGGG AGAAGAGAAG ACAGCCAGAA 
13 751 GCCAGCAGCT GTGGCTCAGA TCTGAGCCCG AGCAGCCTCT CGAGGTGGAG 
13801 GCAGACACCC CCCACCCCAC CCCGTGCAGA AAGAAGCCTT GCCAGCCTGC 
13 8 51 CCTGAGGCTG GTACAGAGTC CAGGCAGGCT CAGTGGCCAT CATGCCCCTA 
13 901 CGATGACTGT CACTCCCTCT CCGTGCGCCT GGCCTCTGCT GGCTCTGGCC 

13 951 AGGGGTGGTC ACAGCACTAG GGTGGCAGGG TGGCCTCTGA CTCTGCGCCA 

14 001 GCCTGCACTG GCCTGTGCTG CCCTGGCCTC TGCTGGCTCT GGCTCTGGCA 
14 051 CCGGTCCCGT GTTGGCTCCT TCAGCCTTCA CATACCTGCT GCGGCCACCA 
14101 CAGGCCCAGG ACCCCCACAG GGTGGCCACC CCACCTCCAC CCCAGGAGCC 
14151 CCAGGTATCC AGCTGTCACC CCCTCCCTCC CTCCTGGCCT CCCCCTGTCC 

142 01 TTCTCCAGTT GCCTTCTTTT CCTGCGGGCG CACCACCCAC CTGCCTGCCT 
14251 CACCTGTTCC GCCTCAGCCC CCAGGGTCCC CGACATCCTG AGCTCAGTGA 
14301 GGAGGGGCTC GGGAGCCCCA GAAGCCGAGG GGCCCCTGCC CTGCCCATCT 

143 51 CCGGCTCCCT TTAGCCCCCT GCCAGCCCCA TGTAAGTAGC CTGGGTCCTG 

144 01 CTGCTGTGGG GGTCATGTTG GAGGGCTGGC AACCCCCTAG AGGGGCCACT 
14451 CCAGAGCCGA GGGCAGGCTG AGCGTGGACC CTGGCTCCAG CCTCATCACC 
14 501 CCACAATCCC TCACTGGGGC TTTCCAGGGT GGCCCCAGCC CATCGAGCCC 
14 551 CACCTCTTTG TGAGGAGGGC CCTGGACCAC TTTCCTGCTC AAGGCCACTG 
14601 GGCAGGATGG GAGGCCCTGG AGGCTCGGGC CTCAATTCCA GTCTTCAGGG 
146 51 TCGGTGCAGG CCTCACTCCA CCTCAGCTTG CGGGCGGGGG GGCTCCCTGC 
14 701 TATTGAGGCA GGCTCTGATT CAGGGCCTGA TCCCAGGGCC CAAGGGGTCT 
14751 AGAACACGGG ACCCCTCCCA CTGGCCTCCT CCGCCTTGCC GCCGCCTCGT 
14 801 GTGTCTGTCT GCCTCATGTT CACGTCTCAT CTGTTCCACC CCAGCCCCCA 
14 851 GGGATCTCTG ACATCCTGAA CTCTGTGAGA AGGGGTTCAG GAACCC CAGA 
14 901 AGCCGAGGGC CCCCTCTCAG CGGGGCCCCC GCCCTGCCTG TCTCCGGCTC 
14 951 TCCTAGGCCC CCTGTCCTCC CCGTGTAAGT AGTGGCCCCC AGGCCTGCCG 
15001 CCTCTGCTGC CGGACAGCTC CCTGCGAATG GCCGGCGCTC AGCAGCTTCC 
15051 CACCTGCATG CACGGCCCAG CTACCCTGCC CCGGCGCCGC AGCCTGGAGT 
15101 CCTGCCCTGG CGGGGCTTCC TGTGGGCTCC CATGCTAACC AGCAGGGCAG 
15151 CTCCTGGCTT CTCCCTAAGG GGCCCAGACC CCTCCACGGC TCCTGCTCCC 
15201 ACTGCCACTC CCCGCTCGCT GTCCAGCCCC AGGCCCCTCT CCAAAATGTC 
15251 TGTCCCAGCC CTGGGCAGCC CTGGCCCCTC CGAGGCCCCC CATGCCCCTA 
15301 GGCCCTCTCT GCTGATCACT GTCCCAGCCC CACAGACTTC ACACCCACCC 
15351 AGGGGCCCTG CCCATGGTGC CCAGGAGCTG CACTCAGGGC CACCCTGGTT 
15401 CCTGATGTGG CCCCAACCCC TGAGCACCCT CCCTCAGTCT AGGAGGCTGA 
15451 GGAAGGTGCC AAAACTGGAA CCCCGACCAG GGTCTCTGGA GCTCACCAAC 
15501 AAGGGGATAG TACGGAGAAT CATAAGCCTG GCCTGTGCTG ACCTGGGCTG 
15551 TCCTCATGGG GCCAGGCCAG GCCTCCTCTG TAACGCCCGT GACTCCCTCC 
15601 TCTCCCTGTA ACCCCGTCCA GCGTTCCTCA AGGGCCACTT ACCTGACAGC 
15651 TTCTTGCTGG CCAGCAGCCT CTCCCTGGAG GGTGCCCTCT GCCCCCAGCA 
15701 GCTTCAGCCC ACGCCACCCG ACAGCCAGAG CATCTGCCCT TCACTCCTGC 
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15751 AGCCTCCTCT CCACGCACCA CGCTGTCCGC AGCAGCACCC TCTGTCCCCC 
15801 TGTCTCCCTC CGTCCCCCCA TATCCCCCTC GGTCAGCCTA CAACCTCTCC 
15851 ACGTCCCCCT AAGTCCACGC TCTATCCCTA CATCCCCCTC TGTCCCCCAA 
15901 ATTCCCCTCT TTCCCTCATT TCCATTTTCC TCCCCAAACT CTGCTCTGCC 
15951 CCTCACATTC TCCCTCTGTC CCCCACACCC TCCTCTGTCC CCCACACCCT 
16001 CCTGTGTCCC CCACACCCTC CTCTGTCCCC CATATACCCC TCTGTCCCCC 
16051 ACACCCACCT TGGTCCCTTG CACGCCCTTT TCTGTCCCCC ACACCCCCTC 
16101 TGTTCCCTAC ACTCTCCCTC TGTCCTCCAG ACCCTCCTCT GTCCCCCACA 
16151 CTCCCTCTGT CCCCCACACC CCCTGTCCCC CACACTCTCC CTCTGCCCCC 
16201 CAGACCCTCC TCTGTCCCCT ACACTCCCTC TGTCCCCCAT ATCCCCCTCT 
16251 GTCCCCCACA CCCTCCTCTG TCCTCCACCC CCTGCCCCCC ATACCCCCTT 

163 01 CTGTCCCCCA CACTTCCTCT GTCTTCCACA CCCCCTCCTG TCCCCCACAC 
16351 CCCCTCTGTC CCCCAGACTC TCCCTCTGTC CCCCACACTC CGTCTGTCCC 

164 01 CCACACCTCC TGTCTTCCAC ACCCCCTTCT GTCCCCCACA CCCCCTCTGT 
16451 CCCCCATACT CTCCTCTGTC CCCCACCTCC CCTCTGTTCC CCACACCGCT 
16501 TCTGTCCCCC ACACCCCCTC TGTCTTCCAC TTCCCCTCTG TCCCCCACAT 
16551 CCCCCTCTGT CCCCTGCACC CTCCTCTGTC CCCTGCACCC TCCTCTGTCC 
16601 CATGCACCTC TCTCTGTCCC CCACATCCCC CTCTGTCCTC CACACTCCCT 
16651 CTGTCCCCCA CATCCACCTT GGTCCCCTCA CGCACCCCCA TCCCCCATGA 
16701 CCCCTTCTGT CCCCCACACC CCCTCTGTCT TCCACACCCC CCTCTGTCCC 
16751 CCACACCCAC CTTGGTCCCC TCATGCCCCC CATCCCCTAC ACCCCCACTT 
16801 TGTCCCCCCA CATGCCCCTC TGTCCCCCAC GTTCCCTTCT GTCTCCCACG 
16851 TCTCCTCCAT TTCCCGTTTC CCTCTCTGTC CCCCAAGCTC CCCTCCATCC 
16901 CCCACATCCC CTTCTTTCCC CTATATCCCC TCTGTCGGCC CAGGTCCACC 
16951 ATCTTCCCCC CACACCCCCC CATTCTCCCT TCCTCCCCTC TGTCCCCTTG 
17001 TGCCCCATCC CCCACATCTG CCTCTGTGCC CCTCAATCTC TGGCTTGGCT 
17051 GTCTGCCCAT GGTTTCTCTC CTGCGTGCCC CCCGTGCCTG CCTTGTGTTC 
17101 ACGTCTCGTC TGTTCCGCCC CAGCCCCCAG GATCTCTGAC ATCCTGAACT 
17151 CTGTGAGGAG GGGCTCAGGG ACCCCAGAAG CCGAGGGCCC CTCGCCAGTG 

172 01 GGGCCCCCGC CCTGCCCATC TCCGACTATC CCTGGCCCCC TGCCCACCCC 
17251 ATGTAAGTAG CACCTTGAGT GGCCGTGGCA GCGGCTGCCT GGAGGGGCTC 

173 01 GGGGCGTGCG AGCCTGGCAG TGGTGCTCTG GGAAGGGCCA TTCTTGCGGA 
173 51 GGAGGGCGGG GCACAGGATC CCTCTGCTGG GTCCCAGGGA ATTGCTTTGA 
17401 AGCACATGAA GGTGCCACTG GGTCTCAGAA AATGGAGGTT ATGGTTATGA 
17451 AGTGTGTATG ACATATGTGT ATAGGAAGAG CGTCCGAAAG AGCAGGTTTG 
17501 TTGCCGACCC CAGCATTCGC AACCCTGAGG TCCACAGCTT TCTCCTGATG 
17551 GGAGGGGAAT GGGTGGCAAA GGGTCTGCGC GTGTGGCAAG GGCTAGCACG 
17601 CCAGGAGCTG CTGGCTTGGG TCAAGGTGGA CCTGCTGGGC CGGGACAGAA 
17651 AAGTGTCAGT CCCGGCCTGA GACGCTCTAG CATTAGAGCT GTCCAAGTCC 
17701 AGACAGCAGG GAGCAGGTGG GGATCGGGAG GCGCGGATCT GGGGGGCAGC 
17751 TGGGGCCAGG CTGAAACAGA GCGGGCGGGA CAGGAAGCAC AGGCTGGGCA 
17801 GCCTCCCCGG CCAGGGAGGA GCCAGGCTGG GCCACCTCCC GGTCTGTCTG 
17851 CCGACTACCC GCAGTATCAC TTACAGGGAT GGATGACATC CCAGGGCTGC 
17901 TGCCACCCCC ACCTGTGGGG AGACACCAGA CTGGGGGTGG TGTGGAGATA 
17951 CTCTTAGAGA AGAGGCTGCT GGGCCACGGG CTCGGCATGG CAGGGCAGTG 
18001 GCTAGGTAAG TACTTGAGGG ACAGGTGGGG TCTGCTTGCC ACCGTCCCCT 
18051 CTGCAGGCTG GGCCTGGGGG CTGCTGCAGG CGGCCAGGGC AGAAGGGTGT 
18101 GGGGAGAGTG AACCCACAGG AGCAGCGGCT CGAGGAGGGG GATGCAGGCT 
18151 GCAGGCTCAA AGGGGCACTG GATCCACCCT GGGTGCCCGA GAGAGCAGGG 
18201 GGCAGCCCCT GGAGGGGTAC TCACCCCCAG AGCTTCTGTG GTCGGCTGAG 
18251 GACCCCCAGC AGGGGTTGAC TGAGGGGATC AGAGGCAAGC AGCTGAGGGG 
18301 AGAGGCCAGG TTCTTGATGC TGATAGGGTC GGGGTGCCTG GGCGACCAGA 
18351 ACTCAAGGAG GGAGGCATGG GGAGGGGCCG CCGTGCAGCT GGGGTGGGTG 
18401 CACCGCAGAG CCTCTGGGAG TGGTCAGAAC CCCCGACACC TGCCACTTCT 
18451 ACAGCAGCTC ATCTGATTTT AAGGGGCTTG CTGCCCTTGC AGAAGTGGAG 
18501 GGGTGTGCCC AAAGGAGCCT GCCTGGAAGG TCACCCCATC AGGTTGGCAT 
18551 GACCCCAGCC CAGGACTGCA GCCTGCCCTC AAGGTCTGTG CAGTATCTGG 
18601 GGTGAGTCCT CTGAGGACAG GGCCCAGGGT GGGTGTGGAG TGGCCAGCTC 
18651 GGGGCTCGGT GTCCAGGCTC ACCTTCAGGG GCCACAGCAC AGACCTGCCC 
18701 TTCCAGAGTC TTCCCTGAGC TTGGCTGGGG AGGAGGGGGC TGCAGGAAGG 
18751 AGCTGTGAGC AGGGCAGGAT GGAGATTCGT GTGGCCCTCC TGGGAGGGGC 
18801 TGGGCAGGGC TGGGAAAGGG GTGGGTGAGA TGTTCCGGAA CTCAGGGAAA 
18851 GGAAGAGTCT GGGTACTGCC CTGGGGGCAC CTGGGCCCAG GTGGCAGGTG 
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18901 GCCAGCTTTC TGCCTCCTTT CCACCTCCTT TCTCCAGAAG GCACCCACCA 
18951 GCTGTGTAAA TAGGGCAGGT GCCCACGGCC CGCCTCAGGC CCCGTCTCCT 
19001 CCCCACCCAC GCTCTCTAAT CGCGGATTAT ACACAATCCA GCCTGATCCC 
19051 TGGGCAGCTG CCCTCCCTCC CGCAGCCACC TCTGGCTCTG AGAGATGGGC 
19101 TTGGGGCCAG CCTGGGGTCC CAGGAGTCCA GGCCAGGATG AGAACCTGCT 
19151 CTGACCCCAC CTGGACGCAT TAGGCCTGCC TGGACCTGTT GCCTCACCCC 
19201 AAGAGAGCCA CAGGCAATGC AAAGGCTCCT GTTCATGTCA GGGCACCTGG 
19251 AAGGCCTGAC TTGCAGAGGC TCTTGGCTCG TGCAGACCCC TCCAAGCCCA 
19301 GGCCCTGCCC ACCACCTCCC CTTTGTCTCT GGAACTGCCA GGACAGCTTG 
19351 TCCTCAGCCA GCAGGTTTCC CGACCCGGGC ACCTCTTCAT GTTGGGCCCC 
19401 CCTCCTTTCC CTCCATCAGG GATCATGCCC TTCTTCAGGG GCCTGGATAT 
19451 CAAGGACACA AAAGCTCCCA TGTGCTATGT GGGGAGGCAG AGTGGGGGCT 
19501 GGGTTGAGCT GGGGTCTGGG CAGCGCCATT CCGCAGGGCA GGGGCAGCCT 
19551 AGGCTTCCCA TCTGTGGAAT GGGTGGGTGG GTCTCACAAC GGACCTGCTT 
19601 CCCGTACTTC AGCACGGTTA CCACTCTTGA TTGGAACTCT GACCATGCAT 
19651 CTCCTCTTCT GTTTACTTCA CGCTTTCTCT TCCCATCAAC TCCCATTTTA 
19701 ATTACAATTT GTTTAAAAGC ACTGCATATT ACTTCATTAA ACAGAAGATT 
19751 AGTTTCACTT ACCATTAGTG TAAGGTGACT ATAGAACCAA AGCAGACTGG 
19801 AAACCAAATG ACATAATGTC ATTCTCTTCT CCATTCCAGC TGCCTGCTGC 
19851 TGTGCGCCTG AGAACCCCTG TGGAGTGGGA GGGGCAGCTG TCTCTGTACA 
19901 TTAGAAAGGG AGGTTAACTA AGTGACAGGA GGTGTTTGGG ACATGTGGAC 
19951 ACCAGACTTC TCTCTTGATG CAAGGAGGGC AGAGCCAGGC AGCCTAGTGG 
20001 GGGCTGGCTT GGGGGCTGCT GGAAGGACTG GCTACAGGTG GAAGAGAGGT 
20051 CAGACCTGAA GCTTGGGGCC ACCTCCAGGA AAGGACAGGT GAAAGTGGAG 
20101 . GCATGAGGCA GGGGAGAGGC AGGTGCCAGG CAGAGGGTGG AGAGGAGGCA 
2 0151 GGAACATAGC AGCTGGGGCG GGGGCGGGCC CTCAAGTGTC ATATGCTACT 
20201 TTCCTGGGGC CCAGGGGCAA GGACAGGAAC AGCCACAGCA TGTGTTGGGA 
20251 CAGAGCCCTG TGCCTTCCTA GAGCTGGGCA GGTGGAATGG GGCAGGAATG 
203 01 GGACTCGTGG TGGCTGCAGC AGGAACTGGA GGGGAAGGGG CTTCTGGATC 
203 51 CTGCAGCCTA CCTTCCTAGA GGCCAGCTTT CCGGGGTCCA CCAGGTGGGT 
20401 GGGAACTGGG CTTGTGTAGC AAGACTGCCC TGAGGACCAT CCATGACATG 
20451 GTCTAGATGA AAGTTAGGAA AGAAAGGGAG ACAAGCTGGC AGCAGAAGTA 
20501 CAGCTGGGTC AGGAGCAAGG GCCTTTCCAG ATAGGGACAA CCCAAGAGTG 
20551 CACATGTGCC CACGCCACAC AACACAGGCA CACACGACAC GTGCACGCTC 
20601 ATAGGCACTG CACACACACA TGCACAGGTG CTCATGCATA TGTATGAGCT 
20651 T CATC T AC AC ACATTCACAT GCCGTCCTGC TTATGTGCAT GTTTCCATAC 
20701 ATGCACATGA ATGCACAATC ACGTGTACAC ACATGCATGT GATCACATAC 
20751 ATGAACATGT GTGCACCCCA CTCCTCAGGT GCCATCGGGC TCCTCCTGCT 
20 801 GTCACTGTGC AGCAGGGGAC ATGAGGCCCC AGAGCAGACA GGTGCAGCAC 
20851 AGGCGTTCCC AGGCAGTGCC CCACACACAT GCATGAGCAC ACCCGGGCAT 
20901 GTGGCGCCTC CTTTGTGGAC TCAGTCCACC TGCCAGGTGG GCTCCCTGGT 
20951 GGTGTGAGCT CCCAGAGGTC TGGCGAGAGA GATAAAGGCA ACCCCACCAC 
21001 CAGGCGTGCT GAGAATTCCC TCTTCTGGCT GGGCACAGTG GCTCATACCT 
21051 GTAATCCCAG CACTTTGGGA GGCCGAGGTG GGCAGATCAC TTGAGGTTAG 
21101 GAGTTTGAGA CCAGCCTGGC CAATATGGTG AAACCTCATC TCCACTAAAA 
21151 ATATACACAC ACAAAAATTA GCTGGGTGTG GTGGTGTGCA CCTGTAGTTC 
21201 CAGCTACTCG GGAGGCTGAG GCAGGAGAAT CGCTTGAACC TGGGAGTCAG 

212 51 AGACTGCAGT GAGCCGAGAT CATGTCACTG CACTCCAGCC CGGGTGACAG 

213 01 AGTGAGACTC CATCTAAAAA AAAAAAAGAA TTCCCTCCTC TGGGAATTTA 
21351 GACCACAGAC AGGTTGCATG TATGTGGCCG TTGGAGGCAG CACTCACAGC 

214 01 AAAGAGTGGA AACGTCACCA CAGGGCCTGC CTTCTGGTGA AAATGGTGTC 
214 51 CTGCAGGGCG GGCAGCTGTT TGAGGGCAGG TGTCCCAGGT GCGGCCTGCA 
21501 GCAGCCTGAG GGTCACAGAG CGCAGTGCTG GGAGTGCAGA GACTTCCCCC 
21551 ACAGGGAGAG TTCCCAGGAA CCTGCTTCCG GTGCACTTCT GGGGGTTTGA 
21601 GTTTTTTCCA CGGACGAATT ACTTTGAGAA ACCACTGTTA CTCGTGTGTA 
21651 TAGGTGAGCG TGCGTGTGCA TGTGTGTTCT GTGTGTGAGT GTGCATGTAT 
21701 GTGCGTGCCT GCGTATATAT CCTCGCAGAT ACGGCTAGGG ACCTCACTCA 
21751 GGACAGTAGT TCTGCCTGAG GAGAGTGAAT GCGGCAAGAT TGAGGAGAAC 
21801 ACAGGCATCT TCAAACTACA TGTGCGGTGC TTTATTTCTT TAAAAATGCG 
21851 TCTAAAGCAA ATAGGAAAAT GTTAAGATTT GAATCCGTAG AGTGTGGGTT 
21901 CTATTATTCT CTCCACATCT TCCATACGTT TAAAATCTTT TGCAATGAAA 
21951 ATAAGCTGTA GTTAAAGCAG CAATGCAGGC TGCCAGTGAG CGCCCCGGAG 
22001 GCCAGTGAGG ACCAGCATGG CTGGGTGGCC TGTTGGAATC CAAGGGGGGC 
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22051 GGGCAGGAGC TGCAGGCAGG CGCCCGGGAG TAGCCCGGGC ATGGGGGTGC 
22101 GGGGCAACAG GGATGTCTGC AGGGGTAGCA TGTGGGCCCC GGACTGCAAG 
22151 CAGGTGGAGC CAGCCGGATG CGGCTCCTAT GAGAAAAGCG GGGAACAAGA 
22201 GACCACGCTC GTTCTTCCTG CTGCGGGGAC AGCCCTGGTC ATCGCTCCGG 
22251 GGAACCCTGC AGCCTGCGCC GCACGTGGCC GCCCCCTGCT GCTTCCTCCT 
22301 CCCCGGCCTC CGGGTGGCCT TGCTGACGGC TCCTTCTCTG AGGCAGGTCT 
22351 CTGCCTTCTC GCCTGGTGCC TGCACTCAGT AGCCCCCTCA CCAGAGCTGC 
22401 TGGGTGAAGG AAGCACTAAG AACCCAAGGC TCGGGAGGAG AGTGGGGCCG 
22451 GGAAGCTGCA GGGAAGCGCA GGGCCAGGCC TGGTGGGCCC AGGGGCTGGC 
22501 TCACGGGAGG GCAGGAGGGA GACTGTGGCG GACAGCACGT GGGGCCAGGA 
22551 GGTGACCTCC AAGTGGATTG TGGGTGGGTT TTTTGTCCTC TTTCTGCATT 
22601 TTCCAGGCAT TTTGTAATGT GGATAGAATA TTTCTGTTCT TCAAAAATAC 
22651 TTTAGTTAAG AAAAATAAGA TGGAAGCTGT TGCACTTGAA AATGAGGAAG 
22701 CCACTGGTGA TGCAGGGGGG GCGGCGGAGA GGACCTCTTC TGCAAATAGC 
22751 GGCAGGAACA CGGCATGGAT GCAGCTCGCG CTCCCCCAGG CCCTCCCCTG 
22801 GGCTGTGTGG AGGGGTCCGG GGGGAATGGG CCAGCGCCCA GTGGTCACCT 
22851 GGCCATGTCT CCCCACAGCC CGGAAGCAGG AGATCATTAA GACCACGGAG 
22901 CAGCTCATCG AGGCCGTCAA CAACGGTGAC TTTGAGGCCT ACGCGTGAGT 

22 951 CCCTGGGGCT GGGGGGGGGC TGTGCAGGAC AAGGATGTGG GACCCTTGGG 

23 001 GGGGCCTGCT CAGAGTCAGG GGTCCACGGG GCCCCTCCTC ACTTGGATTT 
23051 GGCCCCCAGG AAAATCTGTG ACCCAGGGCT GACCTCGTTT GAGCCTGAAG 
23101 CACTGGGCAA CCTGGTTGAA GGGATGGACT TCCACAGATT CTACTTCGAG 
23151 AACCGTGAGT GAGGAAGCCC GGGTGGGCAT GAGGGGGCGG TGCCCCCAGG 
23 201 AGAGCCTCTC GGCCCCTCCC AGGGACAGCA TGGTGGCTGC CTATGGAAGC 
23251 CCTGTCCCCT CTGTGCCCAG GGTTGGCCAG CCACCTCTCC CCCGCCAGAG 
23 301 GCCATACCCA GCCCCCAGAA TCCCACTCTT GGAGGGGCCC ATGCTGCTCC 

233 51 CAGGAGAGCC GAGCCTCCCC AATAAGGGGA GTTGAGAGAG GGAAAGGATT 
23401 AGGCTGGTGG GGTGGAAGAC GGGCACCAGG GCAGTCATGG TAACCCGAGA 

234 51 CCCCCGCCCC GCCTGCTGTC CACAGTGCTG GCCAAGAACA GCAAGCCGAT 
23501 CCACACGACC ATCCTGAACC CACACGTGCA CGTCATTGGA GAGGATGCCG 
23551 CCTGCATCGC TTACATCCGG CTCACGCAGT ACATTGACGG GCAGGGCCGG 
23 601 CCCCGCACCA GCCAGTCTGA GGAGACCCGC GTGTGGCACC GCCGCGACGG 
23651 CAAGTGGCAG AACGTGCACT TCCACTGCTC GGGCGCGCCT GTGGCCCCGC 
23 701 TGCAGTGAAG GTGAGTGTTC TGTGCTAAGT GACAGCTGGG GCAGAGGGGT 
23751 GGCGGTGGTG TGAGTGGCTG CAGCCTGGGG AGGCGATGGG GAGCGGTGGG 
23 801 GCCTGTGGCA GAGCCCATGC CTGGGAAGTC CCTGAGCTTT CCTGGTGAGG 
23 851 CCACAGGAAT GATGTCAAAT TAGGGACCAC GGCAGGCTGG GTGTGGCAGG 
23 901 CCTCCCCAGA GGACTGGGGA GCTGGTGAGG GCCTGAGCAG TCCACACTGG 

23 951 CCAGAGCTGG GTGGGTTGCA GGTGGATGGG CCCCGGGCAG CACAGTCCTG 

24 001 GGCACCATGC CCTGTTTGTG AGGACTGTTA GAGCCCCAGA TGGGCGTTCC 
24 051 CCAGGTGGTG GGTGCAGCGG GCCCAGAGCC CAGTTTTACA GGGATAGTAG 
24101 TAATTGGGTT GGGCACCTTG AACCTCTCTC CCGAGTGGGC CCTTTTCTGG 
24151 ACTTTAACCC TCTCTGCAGT GCCGCATGGC AGACAGCAGA GCCTGGGGGT 
24 201 GGATGGGAGA GGGGGCTGCT GAGGAGCTGA CCCACCCGCC CCATTTCAGA 
24251 GCTGCGCCCT GGTTTCGCCG GACAGAGTTG GTGTTTGGAG CCCGACTGCC 
24 3 01 CTCGGGCACA CGGCCTGCCT GTCGCATGTT TGTGTCTGCC TCGTTCCCTC 

243 51 CCCTGGTGCC TGTGTCTGCA GAAAAACAAG ACCAGATGTG ATTTGTTAAA 

244 01 AAAAAAAAAA AAAAAAAAAA AAAAAACAAG ATGACGACGA CAACCACAAA 
24451 AAAAATTGAC ATCAGATGAA ATGAAAAAAA AAAAAAACAA AAAAAACTAA 
24 501 AGGAAGGAAA AAGCTGTAAA AATCACTGGC ATTCGTGGGG CCACTCCCCA 
24551 CCCAAGCTCC ACGTGTGTCC GTCTGTGCTC CTGGCCTCTG GGGGACCAGC 
24 601 TGGGACATGA ACTTGTCTGC CAGGCCCCCG TCGCGTGCTG AACGGTGTTA 
24651 GTTTGTAGGT AACGCACACA CCCCACACCT AAGGTGTCTG CATCCTCCTG 
24 701 CCAACGCATG GGCTCCACGT GGTGTGCTCG CTGGCTGTCG TGACTGTCAG 
24751 CTGTCTCTTG GGAGGGGCTG TGGGGGCCCG CTGGGCTGCC TCCTTTCCCG 
24 801 CTAGTTGTGC CTGAGAGTTG CTGTTGTTCC TGCTTTCCCT TCCCTTCCTT 
24851 TCATCCCCTG AAGGGCTAGG TGTGGGTTTT CCGTGCCCGG TATCCCCACA 
24 901 CACCCAGCAC GGACAACCCT TCGGCAGAGC CCAGGCCGGC CCCTCACCCC 
24951 CTGGAGTATT GAAACTGGAG TCCCGTCCCC AAGGCCTTCA GAGATGCCCC 
25001 TACACACCCA GGGCTCCAGC TCTGGTCCTT CTGGGGGAGT AAAGTGCAAA 
25051 GAGGGGCACA GCTTAGTTTT GGGCCTCTCG CCGAGCAAGA GACAGCACTG 
25101 CTGGCTACAG CTCCAACACA GCCAGCTGTG GCAAGAGGAC TCTGCCTGGG 
25151 CTGGCCCCCC TCCTGTGTGA GGTGTCTGTC CCTTCTCTGC TGGCCAGCAG 
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25201 CAGATGCACT GGCAGCTCCC AACCCTGTTT CCGCCCCTCG GCCCTCCCCC 
25251 AGCCTGTTCG GCTTCTCTGC AGCCCGCAAG GGGGAGCAGA CTTTTGACAA 
25301 AGGACTGCGG GCCTCGCTCA AGTCCCTGAG CCCCCAGCTG AAGCTGGGAG 
25351 GGGAGGCCAG GCTTTGTGTC TGGGCATATT CGTCTGCTGA TGGGGTTTGG 
25401 GGAAGCCTGG GGCTTGGGGT TTGGTCGGGT GGTGCAGCTA GTGGCAGAGC 
2 5451 GGGATCAGAG GTGGTGGCTG CCCAGCTTCT GGGCTGAGAC AAGGGTCTGT 
25501 GCAGGGGTTT ACTGAAGTGG GAGTGCCTTT GGAATCTGGG CCGGGAGCAG 
25551 AAGGGAGCAA AAGCTACAGT GGGAGCCAGC CTAGGGCACA TGGGAGGCGT 
25601 GAGGGCAGTG CTGCCCGTGC AGTGTCAGGT GTGCCAGTGC CTTGGCGGGC 
25651 TGCAGTGCGT GTGAGGGCAC CTTCTAGGTG GGCCAGGGAT GCAGCTATGG 
25701 AGATAAGGCG GGCTGGGGAC AGAAACAGGT GGGCACAGGG CCCAGGACAC 
25751 CAGCGGATGG AGGGCAGGGT CTAGCCCTGT GCTCCTGAGC GTCGGCTGCC 
25801 TGGGTTCGAG GCGGTGGGTC CCCGGCCCCT TGTGATGGTG TGTACCATGG 
25851 GGGAGCTCGG GGACAGGGCA AGCCCGAGCA TGGTGGGGCT GCAGGGTGGG 
25901 TCTGAAGCCA GGTTGGGTGG GGGTGGTCAC AAGCCCTGAC TGCAGAGGGT 
25951 CAGGGGCTCC TGCCCCAGTG CCTGCCCACT TTCAATTCAC ATTGTTTTCA 
26001 ACAAGGATTT TCTTTATCTT CCCCTACAAA TCAAGCCAAG GGAGGGGCAC 
26051 AGAATGGGGA ACAGGACACA GGATCCTAAA CTCCAAGGGG ACTGTCCACC 
26101 GATGAACACT CAGAGTGGAC ACCATCTTCC GTCCACGCTG TGCCCAGGAC 
26151 AGCTGTCCCC ATCCATGAAC ACAGGGTAAA CATCTGCCGG GCTCCGCACC 
26201 AGTGGCTCCC TGGGCCATGG GACAGCGGCA GGGCTCACCA CGGACAGCAC 
26251 GTGGCCCAGC AGCCGGCCAC CCTGGCGTCC TGGGGCCTCC TCCCCTCCTC 
26301 TCCCTCTCAC CTTGTCACCT CCACGGAGCT GCCTGTCTGG GATAATTTGG 
26351 GGATTTTTTT TCTGGGGGAT AATTCTTTTG CATGACCCCT AAAGAGCAAG 
26401 CCACACCGGT CTGCTAGCTA GGTGTCCGCG GTGTGGTGGT GGCGGCCGCT 
26451 GGCCAGCGCT GCAAGGGGTC GGCTGCCCAC GGTGCTGGCT GGCCTCCCCT 
26501 CCTCTCTCTT TTTGCTGAGT TTCATTGTCT TTTCTTTCTG AGCCTTGTAA 
26551 GTGTACAAAA ATTATTCTTA TTTTGTTCTG TCTCGGGAAA CTGCAAATAA 
26601 AAGAAAAACA GGACAAACTG CTTCAAGTGC AGCTGGGTGC TTTAGCTGGA 
26651 ATCCTGCCGA CCTCCTGCGC CAAAATACAG ACTCAAGCCC GGTCCCTGGC 
26701 CAAGACCCTA CTTGGGCCCC TCCTCCAATG AAAGGTAGTG CTATGGGAGC 
26751 CCTGAGCTGG CCCTGACAGT CCTGAGCCCC TCTAGGGTGA ACGGCTCACC 
26 801 CCAGGTAGGG CACTAGTCAT AGATCATAGC TCTACCAGCT GTCTCCACCT 
26851 CTTCCTCTGG TCCTCTGAAG TCTTCTGGGC CCAGCGCTGT CCACCCTGAA 
26 901 TGCTGGAACT GAAACTGGAT CCCAGCCCCC AACACCCCTG ACCTCTCCAT 
26 951 TCACCCCCGG TGGCCGCTAA GGATGTGGCC AGGGCAGCCT CTGGGCAGGA 
27001 AGGAGCCCCA GGACCAAGAC CTCTGGCTGT CCTGCTGTTT CCTTCCGCCC 
27051 CTGCTACATG TATTGGCTAT TCTGGATGCT GAGGACACAC AGTGACCACA 
27101 GAGCCGGGCT CCACCCCAGT GGATTATGCA GACAGATGGC ACGCAGGCCT 
27151 GTGTGGACAT CAGCCTCGGG CACCAGACAT AGGCAAGGCG CAAGGTGATA 
27201 CAGTAGGCAG CCACCATGGG GGCCAGGAGG CTCCAGCAGA GGCCACACAA 
27251 CCAGCCCAGA ATCCAGGACA GAGAGCTGGA ATGGAGACAG GGAAGCCAGA 
2 7301 TACCAGGCCA GACTGGCCAG GTGCTACAGG CCTGTGGGCC AGGCCAGGCT 
27351 TGGGGACTTC GTCCTGGGTG TGAAGGAGAC AGGCACCCCT GAGGCCTTCC 
27401 CTCTGCATCT CCAGCCCAAG CTAAGCGCAA ACTCTTAGGT TGGAGTAAGG 
27451 AGTAACCCCC TGCCAAGTTT CTCCTGTCCT CAGGCTCCAC CCACCACCTA 
27501 TGCTGCCTGG CCCCATGGGG CACACGCTCA GGCCCAGCCT GGGAAAGCAA 
27551 CTGCACCTGC CTGTGCTATG CTGGCCCTTC TCAGCCTCAA TGCCCTCCTC 
27601 CCTCCCCGAC GCACCCTCGT GGCCCCCGCT GGGCCCCCTG ATGCACCCTC 
2 7651 ATGTCTCCAT GGCAACCTGC TCAGAGTGTG GCCCTGCCCT TGGCTCCCCT 
27701 CCACACCTGT GTCCCAGGCA GTGCCACGGC ACTTTCCTAA ACAGAAGGAT 
27751 GGGCTTCAAA ACAGTCCCAG ACACTAAACA CACCTGCATT TTGGGTCCAA 
27801 GTAACTTCTG ACAAGACGAG TGCCCCTACA CACCCTCAGT CCTATCCACT 
27851 ATGGGCAAGG AGCCTGAAGG ATCCCCCAGA ACTGGCTAAA GCCCTCAGTC 
27901 TCCTCCTCCA CCCTGAGCAC CTTCACGCGG CAGAGTGGCC CTGGATGTCA 
27951 GCTTCTTGCT CCCCATGGTC TGCACCTGGA CAGGTGCTCT CAGGTGTGTG 
2 8001 GGTGGGCAGG TGGCAGGTCC CAAGAGCCAG GTGCAAAGAA TCTAGGCCAG 
28051 TGCCCACGAG TGCTGCAGTG TCTGTCCCCA GCATGGTATC TAGGGCTCCA 
28101 CTTGCCTATC AGCTGTAATC GGAGGAGGCT TTCCAGGCCA GGCCTCCCCC 
28151 AGGAAGGCTG CAGGCACTGC GGATCGTGCG CCCTCACATG CATTATTCCT 
28201 GAGGCCCTTC TGCAGATGCC ATCAGGGCAG CAACTCTGAT GAGGTATTAG 
28251 GGCACAGCAC ACAGGGCTAA GCCACCCTGT ACTGGGCCAA GCGCTACAGG 
28301 CAAAAAGGAC ACCACCGACG GGCATTTCAT TCATCGCTTT TATTTTTATA 
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28351 TATTTTTGAG AGGGAGCCTC ACTCTGTCGC CCAGGCTGGA GTGCAGTGGC 
2 8401 GCGATCTTGG CTCACTGCAA CTTCTCCCTC CTGGGTTC (SEQ ID NO: 3) 
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CACCTCTGGGTTTAAACAACATGCACCCTTGTGCCGGTCACCTCCCTGCAGCCGGAGAAC 
CTGCTTCTGGCCAGCAAGTGCAAAGGGGCTGCAGTGAAGCTGGCAGACTTCGGCCTAGCT 
ATCGAGGTGCAGGGGGACCAGCAGGCATGGTTTGGTGAGTGCCAGGGGCAGGGTGTGTTG 
GCTGGCAGTTGGCAGGGCAGGAGGTGATGCTGACAGCCCCTTGTGGCCTCTTCCCCTCTC 
TCTAGGTTTCGCTGGCACACCAGGCTACCTGTCCCCTGAGGTCCTTCGCAAAGAGGCGTA 

[T,C] 

GGCAAGCCTGTGGACATCTGGGCATGTGGTGAGGCCTGGCCTGAGTTGGTGCGGGGCAGG 
GCCTCGGGTGTTTCAGGACTTCCCACCTACATCCTGGAGTGTGCAGTGGCCAGCACGTCT 
TGCTCTCATCTGGGTTTATCTGTGTCAGACCTGCCCTTGAGCTGCCCTGGCAGGGGTCTG 
CCCACACAGCCAAGAGCCCCCTTTCCACCCAGATTAGAATTGCTCACATGAACCTGGCGC 
ACCCCAGTGCTCGCCTGCGCTCAGCAGAGGTCTGGTCCAGAAGTGTGGTGGGTGGATGGG 

GTTTAAACAACATGCACCCTTGTGCCGGTCACCTCCCTGCAGCCGGAGAACCTGCTTCTG 
GCCAGCAAGTGCAAAGGGGCTGCAGTGAAGCTGGCAGACTTCGGCCTAGCTATCGAGGTG 
CAGGGGGACCAGCAGGCATGGTTTGGTGAGTGCCAGGGGCAGGGTGTGTTGGCTGGCAGT 
TGGCAGGGCAGGAGGTGATGCTGACAGCCCCTTGTGGCCTCTTCCCCTCTCTCTAGGTTT 
CGCTGGCACACCAGGCTACCTGTCCCCTGAGGTCCTTCGCAAAGAGGCGTATGGCAAGCC 

[T,C] 

GTGGACATCTGGGCATGTGGTGAGGCCTGGCCTGAGTTGGTGCGGGGCAGGGCCTCGGGT 
GTTTCAGGACTTCCCACCTACATCCTGGAGTGTGCAGTGGCCAGCACGTCTTGCTCTCAT 
CTGGGTTTATCTGTGTCAGACCTGCCCTTGAGCTGCCCTGGCAGGGGTCTGCCCACACAG 
CCAAGAGCCCCCTTTCCACCCAGATTAGAATTGCTCACATGAACCTGGCGCACCCCAGTG 
CTCGCCTGCGCTCAGCAGAGGTCTGGTCCAGAAGTGTGGTGGGTGGATGGGAGTGGAGAA 

GAATTCTTGCCCCTGCCTGAGAGGGAGCTTCAGGCCCGGCCGGGGCGCTGTTTCCTTCTG 
CAGTTCCCGTCCCCTGAGTGGGACACCGTCACTCCTGAAGCCAAAAACCTCATCAACCAG 
ATGCTGACCATCAACCCTGCCAAGCGCATCACAGCCCATGAGGCCCTGAAGCACCCGTGG 
GTCTGCGTGAGTCGCCCTTGGTGCCCATGGTGGGGAGGGGGCTCCTGGTGGAGATGGCCT 
CAGACCACTCCCCTGGCAAGGACCCCAAGAGGGTCCTGTTCCTGACATCCAAGAGCTCCC 

[T,C] 

TGGGTCCCCTGGGTGCTCCTTGTGGCCTCTGGCTTGGGACATACCAGCACGTTTGTGAGG 
CCTGGGGCTTGGAAGGCATTAGAGGGTAGAGGTGATCCCTTCCTCCCAACTGCAGTCCTG 
TCTGTGAGGGGCAGAGTGGACGAGGCAAGGGAGAGACGAGTCTTGAAGTCCCAGGCGGGT 
GGGGACAGACAACCCTTGCCGCAATGGTGGCCGGTGGCTCTTGGCAAGTGGGGACCCCAG 
GGTGCCACAAGCCTTGCCACCCTGGCCTCTCCCCTGTGCCTCGGGCTCGGCTGCCATATG 

CTGACCATCAACCCTGCCAAGCGCATCACAGCCCATGAGGCCCTGAAGCACCCGTGGGTC 
TGCGTGAGTCGCCCTTGGTGCCCATGGTGGGGAGGGGGCTCCTGGTGGAGATGGCCTCAG 
ACCACTCCCCTGGCAAGGACCCCAAGAGGGTCCTGTTCCTGACATCCAAGAGCTCCCTTG 
GGTCCCCTGGGTGCTCCTTGTGGCCTCTGGCTTGGGACATACCAGCACGTTTGTGAGGCC 
TGGGGCTTGGAAGGCATTAGAGGGTAGAGGTGATCCCTTCCTCCCAACTGCAGTCCTGTC 

[T,A] 

GTGAGGGGCAGAGTGGACGAGGCAAGGGAGAGACGAGTCTTGAAGTCCCAGGCGGGTGGG 
GACAGACAACCCTTGCCGCAATGGTGGCCGGTGGCTCTTGGCAAGTGGGGACCCCAGGGT 
GCCACAAGCCTTGCCACCCTGGCCTCTCCCCTGTGCCTCGGGCTCGGCTGCCATATGACC 
ACCCATTTCCCCACAGCAACGCTCCACGGTAGCATCCATGATGCACAGACAGGAGACTGT 
GGAGTGTCTGAAAAAGTTCAATGCCAGGAGAAAGCTCAAGGTGAGGCCCTGGCCCCTAGT 

GTGGAGATGGCCTCAGACCACTCCCCTGGCAAGGACCCCAAGAGGGTCCTGTTCCTGACA 
TCCAAGAGCTCCCTTGGGTCCCCTGGGTGCTCCTTGTGGCCTCTGGCTTGGGACATACCA 
GCACGTTTGTGAGGCCTGGGGCTTGGAAGGCATTAGAGGGTAGAGGTGATCCCTTCCTCC 
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CAACTGCAGTCCTGTCTGTGAGGGGCAGAGTGGACGAGGCAAGGGAGAGACGAGTCTTGA 
AGTCCCAGGCGGGTGGGGACAGACAACCCTTGCCGCAATGGTGGCCGGTGGCTCTTGGCA 

[A, T] 

GTGGGGACCCCAGGGTGCCACAAGCCTTGCCACCCTGGCCTCTCCCCTGTGCCTCGGGCT 
CGGCTGCCATATGACCACCCATTTCCCCACAGCAACGCTCCACGGTAGCATCCATGATGC 
ACAGACAGGAGACTGTGGAGTGTCTGAAAAAGTTCAATGCCAGGAGAAAGCTCAAGGTGA 
GGCCCTGGCCCCTAGTCCCAGGCACGGCCATGCTTCTCTGTGTCCCTCTGGGCTGGAGCA 
GGGGGGCCTTGGGGGGTCTGGGCAGACCTAGGGGTTACTGCTGCCCCCAAGACTGACTGT 

> 4 1 6 TCTGGGCTGGAGCAGGGGGGCCTTGGGGGGTCTGGGCAGACCTAGGGGTTACTGCTGCCC 
CCAAGACTGACTGTTAGCAAGTCCCAGACTGGATGCATCAGGTGAACTCAGGCCAGCTTG 
GGAATGAGTCCAGAGGGGCCCTGGGCCAGGTGTGGCTCCTCCTAGTTGTCTGTGCCACCT 
CCTAGCAGCCCTTGGAGGAGCTGTCCTGAAGCGCTCGCTGTGGGCTCCTCACCCGGGCTC 
TGCAGGCAGCACTCACCCTCTGGCAGTCACACTGTTTAGTACAAGCAAGTCCGAAGCTTC 

[C,T] 

GGCTCAGACAGGTTTGGTAAGGAGAGCAGAGCCACACACACTGGTCTTGGGTGGGCTGGG 
GGAGTTCTGGGAGGGAGGTGGGTCCCAGTAGGGTATCCAACCTGCCTGCTTTGGTCAGGG 
CTGGCTCCGGTGACCGCACACTGGCAGTCCCTCTACTTGTGGGTTCCGGGATGGGGACTT 
GTTGCCTGACTGCCCTCTGCTGGTCTCTGAGCAGTTCTCCCCGGAAGCCCCAGGACTGTT 
GCCCTGTCTGAGCCTGTCAGGAAAAGAAGGGGCTGTCAGGGAGCTGGACCCCAGAGGAGC 

4 698 GCTAGGTGGCCCCTGGGCTACACCAAGCCCTTCTGGTCCTGGCCCCCGAGGTCTGGGGGT 
CCGGAGACCCCCATTAAGAATGGCCTGGGCCCCACAGGGAGCCACTGGGCCTGCTGCTGG 
GGGGTCTGAATCCTGAAAGGAGAGCCTTGAGGAGCAGAGCCAGAGAGGCAGAGGCCCTTG 
GGGCAGACACACACCCTGCCCCTCTGGGGCCGCATGGAGACGGTGGTCTGTGCTGCTGAG 
TCCTACACATGCATGTCTGCCCTGAGCATCCCCCCAGGACAAGCCGCTCTGGAGTGGGTG 

[A, G] 

GGGTTTTATGCACCCTGAGGAGACTTTCAAGGCTTCCTCTTGGGTTGTTTCTGCAAAGTC 
CTCCTCCCCTGGCCTCAAACCCTGTGAGGGAAAAGGCCGGCACTGGCCACCTGCTCCTCT 
GGGCTGTGCGGGGCCAGAGCCCAGAGGCCCAAGTTGGCTTCTGCCCACCTGCTGGCTTGT 
GACCAT 

5424 CCTCCTCATGACCCACAGGGTGAGCAGCCTGGCCTTCCCAGCCAGAGAACCCTCCTTCTG 
GGGAGGCCCAGGGCGTCCTCGGGGAGGGCAGTCTATTCTCCTCCCATGAGCCCAGTGGAC 
GTGTCTAGCAGGCAGCACCCCGGGAGAGCCCTCCCACGTCTTCTCCATTTGACAGGCCTT 
TCCAGAGCGCAGGCGGGAGGGGGCTGTGATTAGAAAAGAGTGAGGCTAGTGGCTTCTGGG 
GAGGCACTGCTGCCCAGGGGACAGTGCTGAGAGACAGCTGCCTCTACGCTGCCCTGTGCC 

[C,T] 

GGGGCTCCCGCTGCAATGCCCGCCTGTCTGCAAGTGAACGTGGGGCGACGGTGCATGAGG 
CCCTGCATGTGTGGCTCCACCCTGGGCGCCGAGAGCAGCTCTGTCCTGGAGGGTGGTCAG 
TGCATGTGGACAGAGCCCAGCATGGCTGTCCTGGGTGACCAGCTAAGGGGACAAGGCAGA 
GGCAGGGCTGAGAGGACCACCCATCCTGCTAGGTCAGCCCAGCTCAGCCATATCACACGG 
CAGTGAGCATGGAGCTCAGTTCTCTGCCAATGGCAGCTGAGTCTAGTACCATCCAGTCAG 

8722 AAGGCCTGTGCTGGCCCCAGTCAGTGCACAGAAGCGGCCCCAAGGCCAGGGCTGCTGGGC 
AGCTCGGAATGAGGGCGAGCAGGGCTGCCCTTGGTGCCTGAGCCAAGGAGCCAATGGGAC 
AGACCTCTGAGCCTGGGTGCCAAGTATGAGGTCTGAGACAGGGTGAGCGCCTGGGCTGGG 
ACAAGGCCCTCTGAGTGGGCGGCCAGCTGCAGCCCACCCACCCCTACCCCAGGAAGGCAG 
GGCCCGGGAGGGCATGACCTCTGGGGTGCTGGCTCAGCTGCCCCCACCCCAACCTGACAC 
[C,A] 

GCTAGTCCTGAGTTCCCATCAGGGAGGAAGCAGCATCCTGCCTTCCTCTAGGAAGAGCTT 
GCATGTGGCCCAGAAGCCAAGGGGGCTCCCCAGCACCCACGGGCATCTCTGGGTCTGGTC 
AGAGGAGAAATCTGGATGCTTGCAGGAGCCCCAGGGTCATGGAGGAGGCTGGAGACAGGG 
CTGTCCTGGGGTGATGGGATGGCCCCCCCACCTGCTCAGAGCCAGCCTGGGTGCTGGAAC 
CACACTTGCCTCAGGACCCTGGGCTTGCTCCTGGGGAAAGAGTGGGGTCAGGCAAAGGGG 

9982 CCAGGAGTGTTCAGGAAGTCAGTGAGGCAGAAGATACCCTCTCCCCACCAGGACCCCACC 
CTCAGCTCCTCCACCATCCTCAACAGGCCGACCCACAGACCACTCCGAAGGTCTGGCTTG 
GTGGGGCTGGGCCAGGATCTGCAGGGGGAACAGCCCATAGTGGCACATTCCACGGCCCAT 
GGGGAGACGGGGCCACGGTGGTGCAGTAGAGAGGTGTCTAAGCCAGTGGCAGCCAAGGGG 
AGGGCTTGCCGTCACCTCTGTGTTCCCTCAGTGCTGCTCTGTGGCTGCCTGAGAGGCAGG 
[G, A] 

CTTAGGGGCTCCCTGCCGGGGAGGGGAGGGGTCCCCACCATGCTCCGCTCCAACTGCGCC 
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CCTCAGTGCCCCTTGCCCTGGGGGCTCCTACAGGTGAACCCTATAGCAGTACTCCCAAGG 
ATGTAAAGTTGTGGCTGGTGGGTGCCGGCCTTCCTGCTGGGGCGCTGTGCTGTGTCCCCT 
CAGCTGTCCTAAGAGCTTTGGGGCTTGCTGGCCCGTAGGTCCCCATATTTGCTGGAAGCA 
GGCTTGGTGTCCCCTGAGAACCCCAGGCCAGGCTTCGGGAGCCAGCCCCAGACCGCCCAC 

10951 ACAGCAGCACCTCCGCCAGCCTGGACAGAGCTCCTGTCCATTCCATCCCTGCCGGCTGAC 
CCAGGCTCCTCCCCCAGCTGCTCCACGCCGCCTCCATCCCTGTCCCCCACTCTGCTCTGC 
ACTTCTTTCTCGCAGGCTCTGGCCACCCACACCTCCTCTGTCTCCCTGTTCCCCTCCTGG 
TGGTCTCCGCTTCCTCCTCTTCTCACTTTCCCTCTCTTTCCTTCCTCTGTGTCTTCCTTC 
TTCTGTAGGAGCCTCAAACCACCGTCATCCATAACCCAGTGGACGGGATTAAGGTACTGC 

[C,T] 

CCACTTTCCTCCTCCCGCTTTCCCCAGGCAGGAGGCTCCAGGCCAGGAGAGAGGTCTGGG 
GCAGCATTTGTGCCAGAGTGGAGGGCAGATGTCCCATGGCCCTGGCCGCCCCTCCCCGCA 
GTACGGTAGGGCCCCAGTCCGTCTTCGTGGGCAACAACAGGACAGACTGGCTCAGGCCCC 
AGGCGCGCCCCTGGAGGTGCTTGGCACAGTTGCGCCCGGTCCCCATGTGGCCGACACTCT 
CAGACCAGGGCTCTGCGTGTCCCACCTACGGCAGGCAGTAGGGCTTCCTGAGGTCTGGAG 

12603 AGTCTCTCTGCCAGGCTCATCTTGCTGGGAGAAGTGGAGCCCTCATGTGTTGGGGATGCA 
GGGTGGCCACAGCACTAGGGTGGCAGGGCCGGCCTCGGACTCCGTGCCAGCCTGTGCTGG 
CTGCCGTGAGAATGCACCCTGGTGAGGGGCGCCCTCCCAGGGACCAGCACAGAACTGGGT 
GTCTTCTCCGGTCACTGCCGCATGAGGTCCACAGAGCTGGGGCCCTGCAGCCGCCAGAGG 
GCATGTCCCCTGAGCCCCTGGCCTTTAAGCCCCGTGGAAGCAGCCGAGGCAGAGATCAGC 

[T f C] 

TCAGAGCCTGGGCTGGTCCTGACACAGGCCCAGCCCTGTCCACCTGCCCTCAGCCACGTC 
CCACCTATCCTTGGCCGCATCCTGACCCGCTGCCTCCCGTGTTTCCTCAGGAGTCTTCTG 
ACAGTGCCAATACCACCATAGAGGATGAAGACGCTAAAGGTACCTGCACTTGAGTCCTTG 
CCCCCCCAGCGGCCTTGGCATTGCTGGGTTGCTCTTTGAGGTGGGTGGGACTTGGGCAGG 
GTCAACTCTCCTGCGACGCCTAGTTTATGCATGTGTTGAGGGGCTCAGGGACCCTGTAGC 

14 583 ACATCCTGAGCTCAGTGAGGAGGGGCTCGGGAGCCCCAGAAGCCGAGGGGCCCCTGCCCT 
GCCCATCTCCGGCTCCCTTTAGCCCCCTGCCAGCCCCATGTAAGTAGCCTGGGTCCTGCT 
GCTGTGGGGGTCATGTTGGAGGGCTGGCAACCCCCTAGAGGGGCCACTCCAGAGCCGAGG 
GCAGGCTGAGCGTGGACCCTGGCTCCAGCCTCATCACCCCACAATCCCTCACTGGGGCTT 
TCCAGGGTGGCCCCAGCCCATCGAGCCCCACCTCTTTGTGAGGAGGGCCCTGGACCACTT 

[C,T] 

CCTGCTCAAGGCCACTGGGCAGGATGGGAGGCCCTGGAGGCTCGGGCCTCAATTCCAGTC 
TTCAGGGTCGGTGCAGGCCTCACTCCACCTCAGCTTGCGGGCGGGGGGGCTCCCTGCTAT 
TGAGGCAGGCTCTGATTCAGGGCCTGATCCCAGGGCCCAAGGGGTCTAGAACACGGGACC 
CCTCCCACTGGCCTCCTCCGCCTTGCCGCCGCCTCGTGTGTCTGTCTGCCTCATGTTCAC 
GTCTCATCTGTTCCACCCCAGCCCCCAGGGATCTCTGACATCCTGAACTCTGTGAGAAGG 

172 90 CTGTCCCCTTGTGCCCCATCCCCCACATCTGCCTCTGTGCCCCTCAATCTCTGGCTTGGC 
TGTCTGCCCATGGTTTCTCTCCTGCGTGCCCCCCGTGCCTGCCTTGTGTTCACGTCTCGT 
CTGTTCCGCCCCAGCCCCCAGGATCTCTGACATCCTGAACTCTGTGAGGAGGGGCTCAGG 
GACCCCAGAAGCCGAGGGCCCCTCGCCAGTGGGGCCCCCGCCCTGCCCATCTCCGACTAT 
CCCTGGCCCCCTGCCCACCCCATGTAAGTAGCACCTTGAGTGGCCGTGGCAGCGGCTGCC 

[T,C] 

GGAGGGGCTCGGGGCGTGCGAGCCTGGCAGTGGTGCTCTGGGAAGGGCCATTCTTGCGGA 
GGAGGGCGGGGCACAGGATCCCTCTGCTGGGTCCCAGGGAATTGCTTTGAAGCACATGAA 
GGTGCCACTGGGTCTCAGAAAATGGAGGTTATGGTTATGAAGTGTGTATGACATATGTGT 
ATAGGAAGAGCGTCCGAAAGAGCAGGTTTGTTGCCGACCCCAGCATTCGCAACCCTGAGG 
TCCACAGCTTTCTCCTGATGGGAGGGGAATGGGTGGCAAAGGGTCTGCGCGTGTGGCAAG 

18188 ATCCCAGGGCTGCTGCCACCCCCACCTGTGGGGAGACACCAGACTGGGGGTGGTGTGGAG 
ATACTCTTAGAGAAGAGGCTGCTGGGCCACGGGCTCGGCATGGCAGGGCAGTGGCTAGGT 
AAGTACTTGAGGGACAGGTGGGGTCTGCTTGCCACCGTCCCCTCTGCAGGCTGGGCCTGG 
GGGCTGCTGCAGGCGGCCAGGGCAGAAGGGTGTGGGGAGAGTGAACCCACAGGAGCAGCG 
GCTCGAGGAGGGGGATGCAGGCTGCAGGCTCAAAGGGGCACTGGATCCACCCTGGGTGCC 

[C,T] 

GAGAGAGCAGGGGGCAGCCCCTGGAGGGGTACTCACCCCCAGAGCTTCTGTGGTCGGCTG 
AGGACCCCCAGCAGGGGTTGACTGAGGGGATCAGAGGCAAGCAGCTGAGGGGAGAGGCCA 
GGTTCTTGATGCTGATAGGGTCGGGGTGCCTGGGCGACCAGAACTCAAGGAGGGAGGCAT 
GGGGAGGGGCCGCCGTGCAGCTGGGGTGGGTGCACCGCAGAGCCTCTGGGAGTGGTCAGA 
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ACCCCCGACACCTGCCACTTCTACAGCAGCTCATCTGATTTTAAGGGGCTTGCTGCCCTT 

19911 AGCACGGTTACCACTCTTGATTGGAACTCTGACCATGCATCTCCTCTTCTGTTTACTTCA 
CGCTTTCTCTTCCCATCAACTCCCATTTTAATTACAATTTGTTTAAAAGCACTGCATATT 
ACTTCATTAAACAGAAGATTAGTTTCACTTACCATTAGTGTAAGGTGACTATAGAACCAA 
AGCAGACTGGAAACCAAATGACATAATGTCATTCTCTTCTCCATTCCAGCTGCCTGCTGC 
TGTGCGCCTGAGAACCCCTGTGGAGTGGGAGGGGCAGCTGTCTCTGTACATTAGAAAGGG 

[A, G] 

GGTTAACTAAGTGACAGGAGGTGTTTGGGACATGTGGACACCAGACTTCTCTCTTGATGC 
AAGGAGGGCAGAGCCAGGCAGCCTAGTGGGGGCTGGCTTGGGGGCTGCTGGAAGGACTGG 
CTACAGGTGGAAGAGAGGTCAGACCTGAAGCTTGGGGCCACCTCCAGGAAAGGACAGGTG 
AAAGTGGAGGCATGAGGCAGGGGAGAGGCAGGTGCCAGGCAGAGGGTGGAGAGGAGGCAG 
GAACATAGCAGCTGGGGCGGGGGCGGGCCCTCAAGTGTCATATGCTACTTTCCTGGGGCC 

2132 8 GCTGGGCACAGTGGCTCATACCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGCAGAT 
CACTTGAGGTTAGGAGTTTGAGACCAGCCTGGCCAATATGGTGAAACCTCATCTCCACTA 
AAAATATACACACACAAAAATTAGCTGGGTGTGGTGGTGTGCACCTGTAGTTCCAGCTAC 
TCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGTCAGAGACTGCAGTGAGCCGA 
GATCATGTCACTGCACTCCAGCCCGGGTGACAGAGTGAGACTCCATCTAAAAAAAAAAAA 

[C,A,G] 

AATTCCCTCCTCTGGGAATTTAGACCACAGACAGGTTGCATGTATGTGGCCGTTGGAGGC 
AGCACTCACAGCAAAGAGTGGAAACGTCACCACAGGGCCTGCCTTCTGGTGAAAATGGTG 
TCCTGCAGGGCGGGCAGCTGTTTGAGGGCAGGTGTCCCAGGTGCGGCCTGCAGCAGCCTG 
AGGGTCACAGAGCGCAGTGCTGGGAGTGCAGAGACTTCCCCCACAGGGAGAGTTCCCAGG 
AACCTGCTTCCGGTGCACTTCTGGGGGTTTGAGTTTTTTCCACGGACGAATTACTTTGAG 

21391 TTGAGGTTAGGAGTTTGAGACCAGCCTGGCCAATATGGTGAAACCTCATCTCCACTAAAA 
ATATACACACACAAAAATTAGCTGGGTGTGGTGGTGTGCACCTGTAGTTCCAGCTACTCG 
GGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGTCAGAGACTGCAGTGAGCCGAGAT 
CATGTCACTGCACTCCAGCCCGGGTGACAGAGTGAGACTCCATCTAAAAAAAAAAAAGAA 
TTCCCTCCTCTGGGAATTTAGACCACAGACAGGTTGCATGTATGTGGCCGTTGGAGGCAG 

[T,C] 

ACTCACAGCAAAGAGTGGAAACGTCACCACAGGGCCTGCCTTCTGGTGAAAATGGTGTCC 
TGCAGGGCGGGCAGCTGTTTGAGGGCAGGTGTCCCAGGTGCGGCCTGCAGCAGCCTGAGG 
GTCACAGAGCGCAGTGCTGGGAGTGCAGAGACTTCCCCCACAGGGAGAGTTCCCAGGAAC 
CTGCTTCCGGTGCACTTCTGGGGGTTTGAGTTTTTTCCACGGACGAATTACTTTGAGAAA 
CCACTGTTACTCGTGTGTATAGGTGAGCGTGCGTGTGCATGTGTGTTCTGTGTGTGAGTG 

225 88 GCTGCTTCCTCCTCCCCGGCCTCCGGGTGGCCTTGCTGACGGCTCCTTCTCTGAGGCAGG 
TCTCTGCCTTCTCGCCTGGTGCCTGCACTCAGTAGCCCCCTCACCAGAGCTGCTGGGTGA 
AGGAAGCACTAAGAACCCAAGGCTCGGGAGGAGAGTGGGGCCGGGAAGCTGCAGGGAAGC 
GCAGGGCCAGGCCTGGTGGGCCCAGGGGCTGGCTCACGGGAGGGCAGGAGGGAGACTGTG 
GCGGACAGCACGTGGGGCCAGGAGGTGACCTCCAAGTGGATTGTGGGTGGGTTTTTTGTC 

[C,T] 

TCTTTCTGCATTTTCCAGGCATTTTGTAATGTGGATAGAATATTTCTGTTCTTCAAAAAT 
ACTTTAGTTAAGAAAAATAAGATGGAAGCTGTTGCACTTGAAAATGAGGAAGCCACTGGT 
GATGCAGGGGGGGCGGCGGAGAGGACCTCTTCTGCAAATAGCGGCAGGAACACGGCATGG 
ATGCAGCTCGCGCTCCCCCAGGCCCTCCCCTGGGCTGTGTGGAGGGGTCCGGGGGGAATG 
GGCCAGCGCCCAGTGGTCACCTGGCCATGTCTCCCCACAGCCCGGAAGCAGGAGATCATT 

22 965 ATAAGATGGAAGCTGTTGCACTTGAAAATGAGGAAGCCACTGGTGATGCAGGGGGGGCGG 
CGGAGAGGACCTCTTCTGCAAATAGCGGCAGGAACACGGCATGGATGCAGCTCGCGCTCC 
CCCAGGCCCTCCCCTGGGCTGTGTGGAGGGGTCCGGGGGGAATGGGCCAGCGCCCAGTGG 
TCACCTGGCCATGTCTCCCCACAGCCCGGAAGCAGGAGATCATTAAGACCACGGAGCAGC 
TCATCGAGGCCGTCAACAACGGTGACTTTGAGGCCTACGCGTGAGTCCCTGGGGCTGGGG 

[-,G] 

GGGGCTGTGCAGGACAAGGATGTGGGACCCTTGGGGGGGCCTGCTCAGAGTCAGGGGTCC 
ACGGGGCCCCTCCTCACTTGGATTTGGCCCCCAGGAAAATCTGTGACCCAGGGCTGACCT 
CGTTTGAGCCTGAAGCACTGGGCAACCTGGTTGAAGGGATGGACTTCCACAGATTCTACT 
TCGAGAACCGTGAGTGAGGAAGCCCGGGTGGGCATGAGGGGGCGGTGCCCCCAGGAGAGC 
CTCTCGGCCCCTCCCAGGGACAGCATGGTGGCTGCCTATGGAAGCCCTGTCCCCTCTGTG 

234 98 CCCGCCAGAGGCCATACCCAGCCCCCAGAATCCCACTCTTGGAGGGGCCCATGCTGCTCC 
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23663 



-5SSSK5SSSSSSSSSSSS5S 

CACAGTGCTGGCCAAGAACAGCAAGCC 
?SSScGCCTGTGOCCCCOCTOCA<= 



CKASGGGSOCaaMGCTaCaGTGOTMCCMCCl*^^ TOTGTGCGTGTGAOT3 
27727 GCAAACTCTTAGGTTGGAGTAA^^^ 

27 CCACC<^CCACCTATGCTGCCTGGCCCCATGGGGCA TGCCCTCC TCCCTCCC 

GCAACTGCACCTGCCTGTGCTATG^^ 

SCTTTCCTAAACAGAAGGATGGGCTT^ 
S^GGGTCCAAGTAACT^^ 

actatgggcaaggagcctgaagg^ 

27834 COUCC*.^^ 

278 CCTCCTCCCTCCCCGACGCACCCTCGTG^ 
TCTCCATGGCAACCTGCTC^ 

CCA( 



CTCAGTCCTATCCACTATGGGCAAGGAGCCTGAAGGATCrcCCAGAAC 
CTTGCTCCCCATGGTCTGCACCTGGACAGGTCCTCTC ACGAGTGCTGCAG TGTCTG 
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* « 



CCTGTACTGGGCCAAGCGCTACAI 
[G,A] 

CTTTTATTTTTATATATTTTTGA „„„„,„„ 
TGGCGCGATCTTGGCTCACTGCAACTTCTCCCTCCTGGGTTC 



CTTTTATTTTTATATATTTTTGAGAGGGAGCCTCACTCTGTCGCCCAGGCTGGAGTGCAG 
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